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Introduction 


The  general  aim  of  the  subject  contract  was  to  support  studies  in 
which  Fourier  representation  techniques  were  used  to  facilitate  ab 
initio  studies  of  the  electronic  structures  of  thin-film,  surface,  and 
interfacial  systems  without  the  approximations  ordinarily  introduced  to 
handle  the  exchange  energy.  The  work  is  reported  in  detail  in  the  eight 
technical  publications  produced  under  the  contract. 


Personnel 


The  work  under  the  contract  was  supervised  and  contributed  to  by 
Dr.  Frank  E.  Harris  (Professor  of  Physics  and  Chemistry)  and  Dr.  Hendrik 
J.  Monkhorst  (Research  Associate  Professor  of  Physics).  For  the  first 
two  years.  Dr.  Harris  was  the  principal  investigator;  for  the  third  and 
final  year  this  responsibility  was  shared  by  Drs.  Harris  and  Monkhorst. 
Major  amounts  of  work  were  also  performed  by  Dr.  William  A.  Schwalm 
(Postdoctoral  Research  Associate).  Effort  for  shorter  periods  of  time 
was  provided  by  Dr.  Tomislav  P.  Zivkovic,  Dr.  Bogumil  Jeziorski,  and  Mr. 
James  D.  Pack.  Dr.  K.  Szalewicz  collaborated  with  us  in  one  technical 
report. 


Narrative  Summary 


Much  of  the  work  under  the  contract  involved  the  resolution  of 
technical  questions  prerequisite  to  calculations  on  actual  systems. 

The  general  methods  to  be  used  are  described  in  technical  publications 
4,  5,  and  6  (vide  infra).  Evaluation  of  the  lattice  sums  describing 
the  long-range  electrostatic  interactions  are  reported  in  publications 
1  and  8.  Techniques  for  sums  over  states  are  discussed  in  publication 
2.  Publications  3  and  7  deal  with  questions  arising  out  of  the  math¬ 
ematics  of  the  analysis.  These  studies  complete  the  material  prerequi¬ 
site  to  actual  calculations;  a  study  of  hydrogen  films  (not  reported 
here)  has  been  completed  subsequent  to  the  expiration  of  this  contract. 


Technical  Publications 


1.  "Convergence  Acceleration  Techniques  for  Lattice  Sums  Arising  in 

Electronic  Structure  Studies  of  Crystalline  Solids".  F.  E ,  HarrisT 
J.  Math.  Phys.  18,  2377  (1977). 
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2.  "Special  Points  for  8ril1ouin-Zone  Integrations--A  Reply",  J.  D. 

Pack  and  H.  J .  Monkhorst,  Phys.  Rev.  B  1_6,  1748  (1  977). 

3.  "Analytic  Connection  between  Configuration-Interaction  and 

Coupled-Cluster  Methods",  T.  P.  Zivkovic  and  H.  J.  Monkhorst, 

J.  Math.  Phys.  T9,  1007  (1978). 

4.  "Hartree-Fock  Density-of-States  for  Extended  Systems",  H.  J. 

Monkhorst,  Phys.  Rev.  B  20,  1504  (1979). 

5.  “Exact  LCAO  Method  for  Two-Dimensional  Crystals  Using  Fourier 

Transform  Technique",  H.  J.  Monkhorst  (not  published). 

6.  "Hartree-Fock  Formalism  for  the  Calculation  of  Total  Energies  and 

Charge  Densities  of  Thin  Films",  F.  E.  Harris,  H.  J.  Monkhorst, 
and  W.  A.  Schwalm,  J.  Vac.  Sci  &  Tech.  J_6,  1318  (1979). 

7.  "Analytic  Continuation  in  Exchange  Perturbation  Theory",  B. 

Jeziorski,  W.  A.  Schwalm,  and  K.  Szalewicz,  J.  Chem.  Phys.  73, 
6215  (1  980). 

8.  "Electrostatics  for  Periodic  Films  of  Atoms",  H.  J.  Monkhorst  and 

W.  A.  Schwalm,  Phys.  Rev.  B  23,  1729  (1981  ). 
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Copies  of  technical  publications 


Convergence  acceleration  technique  for  lattice  sums  arising 
in  electronic-structure  studies  of  crystalline  solidsa> 

Frank  E.  Harris6' 

Dtpartment  of  Chtmistry.  Unittniry  of  Hawaii  at  Manoa,  2545  Tht  Mail  Honolulu.  Hawaii  96822 
(Received  24  June  1977) 

Slowly  convergent  lattice  summations  arise  when  at  initio  quantum-mechanical  studies  of  electronic 
structure  m  crystalline  solids  are  carried  out  by  Fourier  representation  methods.  Summations  of  this  type 
are  identified  and  discussed,  and  it  is  shown  how  a  technique  related  to,  but  not  identical  with,  that  of 
Ewald  can  be  used  to  accelerate  their  convergence.  The  presentation  is  illustrated  with  numerical 
examples. 


I.  introduction 

It  is  well  known  that  lattice  sums  of  electrostatic 
energy  contributions  (Madelung  sums)  converge  so 
slowly  that  convergence  acceleration  schemes  are  of 
great  importance.  This  observation  is  not  only  relevant 
for  the  conditionally  convergent  summations  describing 
the  potentials  of  charge  arrays,  but  also  applies  to 
systems  of  higher-order  multipoles  for  which  there 
are  no  formal  convergence  difficulties.  Illustrative  of 
the  problem  is  the  summation  of  r‘‘  over  the  nonzero 
points  of  a  simple  cubic  lattice.  To  obtain  this  sum  to 
five  significant  figures,  it  is  necessary  to  include  points 
out  to  approximately  r  =  10,  a  total  of  over  4,  000  points. 
Allowance  for  the  crystal  symmetry  would  reduce  the 
sum  to  that  of  approximately  150  inequivalent  points, 
but  in  actual  applications  the  distances  may  be 
measured  from  a  low- symmetry  point  or  occur  with  an 
offset  (e.  g. ,  I  r2  +  5’  I  "•).  We  see  that  from  a  practical 
viewpoint  the  convergence  difficulties  of  inverse  power 
summations  persist  to  surprisingly  high  powers. 

The  two  best-known  methods  for  accelerating  conver¬ 
gence  of  Madelung  sums  are  those  of  Evjen1  and  Ewald. 1 
The  Ejven  method  consists  of  grouping  together  the 
contributions  of  shells  of  points  in  such  a  way  that  the 
low-order  multipole  moments  of  each  shell  vanish.  The 
result  is  that  the  contributions  of  shells  fall  off  with 
increasing  distance  more  rapidly  than  do  the  contri¬ 
butions  of  individual  points.  We  shall  not  discuss  the 
Evjen  method  further  in  this  paper,  because  it  is  not 
directly  applicable  to  summations  where  all  points 
make  contributions  of  the  same  sign.  The  Ewald  method 
involves  the  introduction  of  an  integral  transform  for 
the  potential,  followed  by  a  division  of  the  transform 
integration  into  two  ranges,  each  of  which  is  then 
treated  separately.  From  one  integration  range  there 
emerges  a  summation  which  converges  more  rapidly 
than  the  original  sum.  For  the  other  integration  range, 
the  Poisson  summation  formula3  is  used  to  replace  the 
summands  by  their  Fourier  transforms,  after  which  that 
sum  also  becomes  rapidly  convergent. 

When  ab  initio  quantum-mechanical  studies  of  the 
electronic  structures  of  crystalline  solids  are  developed 
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in  a  Fourier  representation  formulation, 4  there  arise 
Madelung- type  summations  of  the  sorts  and  with  the 
problems  identified  above.  It  is  therefore  highly  desir¬ 
able  to  apply  convergence  acceleration  techniques  to 
such  summations,  but  they  differ  from  those  previously 
studied  to  an  extent  which  renders  impractical  the  usual 
acceleration  techniques.  The  purpose  of  this  paper  is  to 
present  and  illustrate  a  method  by  which  the  most  time- 
consuming  summations  in  ab  initio  quantum- mechanical 
calculations  on  solids  can  be  evaluated  more  accurately 
and  conveniently. 

In  succeeding  sections  of  this  paper  we  describe 
briefly  the  physical  problems  giving  rise  to  the  sum¬ 
mations  whose  evaluations  we  seek,  we  give  alternative 
(unaccelerated)  forms  of  these  sums,  and  we  present  an 
acceleration  technique  in  the  spirit  of,  but  not  identical 
with  that  of  Ewald.  Illustrative  results  indicate  the 
effectiveness  of  the  method. 

II.  PHYSICAL  BACKGROUND 

Fourier  representation  techniques  have  been  used  for 
the  evaluation  of  the  multicenter  integrals  arising  in 
electronic  structure  calculations. 5  In  such  approaches 
a  key  quantity  is  the  Fourier  transform  of  a  product 
of  atomic  orbitals.  For  example,  the  electron  repulsion 
integral  ( (P„i pc  irjjl  is  given  by  the  well-known 


formula 

<?*<?*>  =2^ 

/ exp(-  iq  •  Rac) 

*o.r* 

(q)«f„(-q). 

(1) 

with  Rte  =  Re-R,,  R,  the  center  for  orbital  and 

Ff*(q)  =  <<P.(r)|exp(tq 

■r)|<p,(r-R„)>. 

(2) 

Equation  (2)  shows  <bf,  to  be  the  Fourier  transform  of 
in  a  coordinate  system  with  origin  at  R,.  When 
(t>,  and  are  Slater-type  orbitals  (STO’s),  the  right- 
hand  side  of  Eq.  (2)  is  cumbersome  to  evaluate  but  can 
be  reduced  either  to  a  single  quadrature  or  to  an  infinite 
series  of  Bessel  functions.*  However,  for  q- 0,  of, 
assumes  a  simple  form;  it  is  then  the  overlap  integral 
{<0,  i  «>,). 

Expressions  parallel  to  Eqs.  (I)  and  (2)  arise  when 
Fourier  representation  methods  are  used  for  electronic- 
structure  calculations  on  crystalline  solids.'  As  an 
example,  consider  the  use  of  Bloch-wave  crystal 
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orbitals  Ik,},  defined  as 

!  k, )  =  exp(ik  •  r)£  <p,[r  -  R„),  (3) 

IS 

where  k  is  the  Bloch  wave  vector  and  the  R„  are  the 
lattice  vectors.  The  index  “a"  denotes  the  functional 
form  of  < This  definition  of  Ik,)  is  of  what  we  call 
“mudulated  plane  wave”  type,  and  shares  with  the  usual 
linear-combination-of-atomic-orbital  (LCAO)  or 
'‘tight-binding”  orbitals  the  calculational  features 
described  below.  In  terms  of  these  crystal  orbitals, 
the  electron  repulsion  integrals  of  interest  are 
{ k,k£  I  r I  k#kj  >  and  (k.k’,!  r'ti  kjk,);  the  first  of  these 
integrals  diverges  faster  than  the  sample  size  increases, 
but  the  divergence  is  offset  by  those  of  the  electron- 
nuclear  attraction  integrals  \k,l£14:r-Rur‘lks)  and  the 
nuclear— nuclear  repulsions. 

The  nondivergent  (i.  e.,  properly  extensive)  part  of 
\'k,k'e:r‘‘lk^ci)  may  be  reduced  to  the  form 

(kxir-jikx)  4*,r4(qJ+*(-qJ.  (4) 

where  -V  is  the  number  of  unit  cells  in  the  sample,  the 
qu  are  the  vectors  of  the  lattice  reciprocal  to  the  R„, 
and  r0  is  the  unit-cell  volume  of  the  Ru  lattice.  The 
prime  on  the  summation  sign  indicates  that  the  point 
q„  =0  is  to  be  omitted  from  the  sum.  The  <J>r  are  trans¬ 
forms  of  lattice  sums  of  atomic— orbital  products,  and 
are  therefore  linear  combinations  of  the  <f>r  appearing 
in  Cq.  (2): 

* f»( <l)  =?<<P.(r) I  expftq-  r)l  ^.(r-RJ).  (5) 

The  integrals  (kXlrJ‘1  kjk,)  and  the  nondivergent  part 
of  ;k.l^!r-RJ*llk,>  also  reduce  to  summations 
involving  *r.  The  overlap  integral  (k,lk,)  assumes  the 
simple  form 


<k,!k,)*N*f,(0). 


(6) 


III.  EVALUATION  OF  (q) 

We  now  consider  more  specifically  the  evaluation  of 
Eq.  (5)  when  <p,  and  <fit  are  normalized  Is  STO’s: 

<?.  =  (£i/«)l/Jexp(-£,r).  (7) 

Expressions  involving  STO’s  of  higher  quantum  numbers 
can  be  derived  by  analogy  or  by  differentiating  the 
results  given  here  with  respect  to  the  screening  param¬ 
eters  £,  and  £k.  Inserting  Eq.  (7)  into  Eq.  (5), 

“^(q)  =  [( £.£>)3/7*]Z/dr 

u 

x exp(-  £,r  -  £,  |  r  -  R,  |  +  iq  •  r).  (8) 

The  integral  on  the  right-hand  side  of  Eq.  (8)  is  that 
which  was  earlier  identified  as  cumbersome  to  evaluate. 
For  simplicity  we  assume  a  simple  cubic  lattice. 

The  most  straightforward  evaluation  of  is  obtained 
by  using  the  Fourier  convolution  theorem  to  write 

+£(q>  =  [1/(2* )3}I  / dp <tf(q  -  p)(?,(r  -  R„)]r(p).  (9) 


Introducing  the  expression  for  the  transform  of  the  Is 
STO, 

(q)  =  81T1  +  tj)1,  (10) 

and  noting  that 

[<?(r  -  R„)]r(p)  =  exp(iRu  •  p)<pr(p),  (11) 


we  have 


/dp 


_ 1  exp(iRu-p) 

( iq  -  pi  *"+  £j)*  (p4  + 


(12) 


Next,  we  interchange  the  order  of  summation  and 
integration  in  Eq.  (12),  reaching  thereby  a  lattice  sum 
satisfying7 

£exp(iRu  •p)  =  (8irVt'0)E6(p-P„),  (13) 

I*  I* 


Crystalline  solids  appear  to  be  far  more  economically 
described  when  the  atomic  orbitals  appearing  in  Eq.  (3) 
are  chosen  to  be  STO’s  rather  than  the  historially  more 
popular  Gaussian-type  orbitals.  Fortunately,  an  STO- 
based  formulation  is  practical,  as  the  quantities  of 
Eq.  (5)  can  be  reduced  to  readily  manipulable  forms. 
Although  the  individual  summands  of  Eq.  (5)  are  difficult 
to  calculate  [cf.  the  discussion  immediately  following 
Eq.  (2)],  the  equation  can  be  made  tractable  by  taking 
advantage  of  the  presence  of  the  lattice  sum.  A  number 
of  calculations  based  on  Eqs.  (4)  and  (5)  have  now  been 
reported. 

While  the  work  done  to  date  suffices  to  demonstrate 
the  practicality  of  Fourier  representation  methods  for 
solid-state  electronic  structure  studies,  it  has  also 
shown  that  the  evaluation  of  Eq.  (S)  consumes  the  bulk 
of  the  required  computational  effort,  and  that  the  root 
of  the  problem  is  the  slow  rate  of  convergence  of  the 
lattice  sum  Involved.  We  therefore  turn  our  attention 
to  methods  for  the  evaluation  of  Eq.  (5),  and  more 
specifically  to  the  Introduction  of  convergence  accelera¬ 
tion  techniques  of  the  kinds  already  found  to  be  useful 
in  Madelung  summations. 


where  the  pu  are  reciprocal-lattice  vectors.  Equation 
(13)  is  sometimes  referred  to  as  a  "lattice  orthogonality 
relation”,  and  is  a  special  case  of  a  Poisson  summation 
formula,  the  summands  on  its  right-hand  side  being 
Fourier  transforms  of  those  on  its  left.  When  the  right 
side  of  Eq.  (13)  is  substituted  into  Eq.  (12),  the  p  inte¬ 
gration  reduces  to  a  lattice  sum,  and  we  have  the  final 
result 


(14) 


Equations  (8)  and  (14)  may  be  regarded  as  the  two 
"standard”  ways  to  express  as  a  lattice  sum.  To 
simplify  further  discussion,  we  recapitulate  these 
equations  in  a  dimensionless  notation  in  which  the  sum¬ 
mations  are  over  a  unit  lattice  of  vectors  m,  and  write 
t’0  =  aJ,  5,=at,/2ir,  q  =  2irv/a: 


4’.7’»(*/) 


(8.».)3/V 


fdr 
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(21) 


*exp(-84r-S*|r-2irjj|  +  »»',r)  (15) 

8(M>)i/2  r _ 1 _  .... 

Leaving  aside  for  the  moment  the  very  real  differences 
in  ease  of  evaluation  of  the  summands  of  Eqs.  (15)  and 
(16),  we  note  that  the  summation  of  Eq.  (15)  will  con¬ 
verge  rapidly  when  5,  or  54  is  large,  but  only  slowly 
when  both  8,  and  8,  are  small.  We  note  also  that  the 
convergence  will  be  exponential  (as  that  of  a  three- 
dimensional  geometric  series).  On  the  other  hand,  Eq. 
(16)  will  converge  rapidly  when  5,  or  5S  is  small,  but 
will  become  inappropriate  when  5,  and  8>  are  both  large. 
The  summation  of  Eq.  (16)  approaches  for  large  4  that 
of  4"*,  with  the  disadvantages  identified  for  such  sums 
in  the  introduction. 

Remembering  now  that  the  summands  of  Eq.  (15)  are 
relatively  difficult  to  evaluate,  we  may  appreciate  the 
real  problem  in  the  evaluation  of  4>[t.  We  should  like  to 
have  at  least  part  of  the  convenience  of  Eq.  (16),  but 
in  a  context  improving  its  convergence,  particularly  for 
large  5,  and  5b.  We  cannot  really  afford  to  use  Eq.  (15) 
unless  8,  and  5„  are  so  large  that  extremely  few  terms 
are  required. 

IV.  ACCELERATION  OF  CONVERGENCE 

Following  the  general  idea  of  the  Ewald  method,  we 
introduce  integral  transforms  for  ip,  and  <pb  in  Eq.  (5), 
planning  to  divide  the  integrations  into  regions  each  of 
which  will  receive  optimal  further  processing.  The 
transform  we  have  found  most  suitable  is  that  suggested 
by  Kikuchi"  and  subsequently  used  by  Shavitt  and 
Karplus8: 

exp(-5r)=-^7J  f  dx  x'l/t  exp(-  x62  -  rJ/4x).  (17) 

v  J  a 

Insertion  of  Eq.  (17)  for  exp(-  8,r)  and  exp(-  8>lr  -  2irp  I ) 
in  Eq.  (15)  (the  dimensionless  equivalent  of  Eq.  (5)] 
yields 

<f>«(v)=^i-~j — dxj^  dy(xy)'l/1 
x  exp(-  52x  -  5jv)  Jdv  exp 


The  r  integration  may  now  be  carried  out,  leading  to 
the  result 


3(6,at)5/y 


XV 

av  - - iTT7 

'  (x  +  v)3/ 


Ar/  ,  3(6A)5/V  fm  ,  r*  .  XV 

rm  dxJ0  dy  Jx  +  y)"1?4' 

x  exp  (-  8jx  -  Sfv  -  ~p-  -  ~~  +l2i£iiU£\ .  ,19) 

V,  ^  *+y  *  +  >’  *  +  v  J 

We  next  notice  that  the  4  summation  will  converge  at 
a  rate  mainly  determined  by  the  magnitude  of  x  +  y. 

We  therefore  change  variables  from  x,  y  to  s,  t,  where 
s=x*y,  f=Cx-y)/(x-‘-y),  with  limits  0  <  ”,  -1«, 

si,  and  with  dx  dy  =  \sdsdt: 

dsj\ dt 

xexp(-ys  -  *ri(l  ■- ()>i  •  v),  (20) 


y  =  i85(l+,)  +  i6’(l-t)+Ml-ti). 

We  are  now  ready  to  divide  the  range  of  s  into  the 
two  intervals  (0,2)  and  (Z,  ”),  where  Z  is  arbitrary 
and  will  be  specified  later.  For  the  first  of  these  inter¬ 
vals  the  4  sum  will  be  strongly  convergent,  in  fact  con¬ 
verging  as  exp(-  .:2|iVZ).  For  the  remaining  interval, 
the  convergence  is  poor,  but  can  be  improved  by  use 
of  the  Poisson  summation  formula 

£exp(-i^42/s-(-xi;(l  +  ,)4  •  v) 

U 

=(f  J7  E  exp[-  s  (  4  -  |(1  +  t)v!*J.  (22) 

After  substitution  of  Eq.  (22)  for  (Z,  ”),  the  integrations 
in  both  s  and  t  may  be  carried  out,  leading  to  the  final 
result 


y,s3/2(i-,2) 


x  exp(-  ys  -  it 24*/s  -  +  t)  4  •  v] 

,  8(5.8t)i/2orexp(-oZ)/  2  1 

P  » I  a(a  — d)4  \a  -  d  a  J 

exp(-  (IZ)  /  2  1  \1 

Itjrjr  id  -  a  d  )\  ’ 


If  there  is  a  4  value  such  that  a  =  d,  the  square  bracket 
on  the  right-hand  side  of  Eq.  (23)  must  be  replaced  for 
that  4  value  by 

<»«■  »• 

We  note  that  the  portion  of  Eq.  (23)  corresponding  to  the 
interval  (Z,  ”)  now  exhibits  convergence  as  exp(-  d2Z). 

Equation  (23)  possesses  the  convergence  acceleration 
properties  we  seek.  The  value  of  Z  remains  to  be 
chosen;  it  determines  the  relative  importance  of  the  two 
4  summations.  In  the  limit  Z  — 0,  the  first  sum  vanish¬ 
es  while  the  second  approaches  that  of  Eq.  (16);  in  the 
limit  Z  —  «  only  the  first  sum  survives,  and  it  can  be 
shown  equivalent  to  Eq.  (15).  Intermediate  values  of  Z 
give  pairs  of  series  corresponding  roughly  to  Eqs.  (15) 
and  (16),  but  each  converging  more  rapidly  than  the 
limiting  series  to  which  it  corresponds.  Remembering 
that  the  first  summation  contains  summands  which  are 
harder  to  evaluate  than  those  of  the  second  sum,  we 
may  choose  Z  small  enough  that  very  few  terms  of  the 
first  summation  are  needed,  but  with  Z  sufficiently  far 
from  zero  that  we  have  effective  exponential  conver¬ 
gence  in  the  second  sum. 


V.  NUMERICAL  EXAMPLES 

For  illustrative  purposes  we  elect  to  aim  for  seven 
significant  figures  and  to  choose  Z  small  enough  that  in 
the  first  sum  of  Eq.  (23)  we  will  need  to  keep  only 
4=0  and  the  six  members  of  the  (1,  0,  0)  star 
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IM  =  (1,  0,  0),  (-  1,  0,  0),  (0, 1,  0),  (0,  -  1,  0),  (0,  0,  1), 

(0,  0,  - 1)].  The  leading  terms  omitted  from  the  first 
sum  will  have  n2  =  2,  and  can  be  expected  to  be  smaller 
than  the  fi  =0  term  by  approximately  the  factor 
exp(-  2ir2/Z),  ignoring  any  additional  convergence  which 
may  be  produced  by  large  values  of  5a  or  5S.  We  there¬ 
fore  tentatively  take  Z  =  1,  causing  the  six  omitted 
terms  with  n2  =  2  to  aggregate  to  about  10‘*  or  less  of 
the  leading  term.  Details  of  the  calculation  of  the  above- 
described  terms  are  given  in  an  appendix. 

In  evaluating  the  second  sum,  we  note  that  the 
&xp(-aZ)  portion  of  the  summand  is  maximal  when 
M=  v,  while  the  exp (-3Z)  portion  peaks  at  n=0.  We 
therefore  reorganize  the  sum  so  as  to  keep  for  each 
portion  a  set  of  terms  radiating  spherically  out  from  its 
respective  maximum.  With  Z  =  1  our  accuracy  require¬ 
ments  will  be  met  if  we  keep  d  values  within  spheres 
of  radius  V 1 8+  about  the  maxima.  There  are  20  stars 
of  u  values  within  such  a  sphere. 

For  the  purpose  of  presenting  numerical  results,  we 
write  Eq.  (23)  in  the  form 

*£(v)  =  SZ(0)  +  SZ(1,  0,  0 )  +  LZ,  (27) 

where  SZ( 0)  and  SZ(  1,  0,  0)  are  the  respective  contribu¬ 
tions  of  the  (i=0  and  (1,  0,  0)  stars  to  the  first  (small- 
Z)  summation,  and  LZ  is  the  contribution  of  the 
second  (large-Z)  summation.  Table  I  gives  the  results 
of  a  number  of  calculations  for  various  6„  6S,  and  v 
values,  both  for  Z  =  1  and  for  other  Z  values.  The 
accuracy  of  the  results  is  attested  to  by  their  substantial 
independence  of  Z;  the  effectiveness  of  the  formulation 
is  indicated  by  the  fact  that  some  of  the  quoted  results 
would  have  required  thousands  of  terms  if  evaluated 
without  a  convergence  acceleration  technique. 
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APPENDIX:  EVALUATION  OF  "SMALL-Z" 

SUMMANDS 

The  n=0  term,  denoted  SZ( 0)  in  Eq.  (27),  can  be 
completely  reduced  to  a  closed  expression  involving 
the  error  function.  It  is  convenient  to  introduce  the 
definitions 

2 

F{u)  =  u~i/2&ri(ul/,)  =  ~r=  /  exp(-uv2)dv,  (Al) 

V  / 

•'o 

G(u)  =  u'l/2e“  erf(u1/2)  =  4i-  /  exp {-uv2)dv.  (A2) 

»l  Jo 

In  the  material  to  follow,  G(u)  occurs  for  negative  u, 
and  therefore  involves  error  functions  of  imaginary 
argument.  We  have  prepared  a  computer  program  for 
rapid  and  accurate  evaluation  of  G(«)  with  negative 
u. 

The  actual  reduction  of  SZ( 0)  is  tedious,  but  can  be 
accomplished  with  a  judicious  choice  of  standard  pro¬ 
cedures.  The  results  can  be  expressed  in  terms  of  the 
auxiliary  quantities  a  =  jv(l  +  d),  3  =  iu(  1  -  d), 
c  =  i(6^6|)->2,  6  =  i(62  -  62),  c  =  i(6|*6 ’♦a’  +  d2), 
and  d  =  (6j  -  62)/v2.  The  final  formulas  depend  upon 
whether  v  or  b  vanish;  we  distinguish  three  cases  as 
set  forth  below; 

SZ(0)  =  [4(645t)s'2Z,/Vff1/Jcv3] 

xCU1' V2c){[aJ  +  d52]F(62Z)  +  [3 2  ~  dt>\)F(b\Z)\ 

+  exp(-  5 \Z){3  -  at [c  -  (4a,3/V)(l  +  Zc)]  G(-  a2Z)} 

+  exp(-  5 \Z)  {a  -  /3[c  -  (4a/3/V) 

x(l  +Zc)]G(-32Z)}  (v*0)  (A3) 

=  [(5a5>)5/,Z'/2/261] 


TABLE  I.  Lattice  sums  ♦Jfv)  of  Eas.  (23)  and  (27)  and  the  "sraall-Z”  and  “large-Z”  contributions  thereto  for  various  values  of 
the  convergence  acceleration  parameter  Z. 
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X  {(1/V  /JZ)(exp(-  6\Z)  -exp(-  5*2)] 

*  (a  -  6)/X5|Z)  -  (a  +  ft)Z(SjZ)}  (v  =  0,  ft  »  0)  (A4) 

=  l^'Va’HH^Z)  -  (2/jr1/2)(l  +  2aZ/3) 

xexp(-5jz)]  (t/  =  6  =  0).  (A5) 

The  other  contribution  we  require  is  that  when  n  is 
a  member  of  the  (1,  0,  0)  star;  this  contribution  was 
denoted  SZ(1,  0,  0).  Starting  from  Eq.  (23),  we  proceed 
by  expanding  exp(-ys)  in  a  Taylor  series  about  s  =  Z, 
after  which  the  s  and  t  integrations  separate  in  each 
term; 

SZU,  0,  0)  J  dty(l  -  t*)exp(- '/Z! 

ST  n»0  •'.I 

x{cos[ir(l  +  t)i/x]  +  cos[r(l  +  f)i>y] 
cos(ir(l  +  f)i>f]},  (A6) 

with 

A,(Z)^/>  s3/J(Z  -  s)"exp(-  rrVs).  (A7) 

The  quantities  A„(Z)  depend  only  upon  n  and  Z,  so  that 
once  Z  is  fixed  they  may  be  determined  once  and  for  all. 
For  Z=  1  they  decrease  rapidly  with  increasing  n:  for 
seven-significant-figure  results  they  are  not  needed 
beyond  n  =  4 . 


The  t  integration  could  be  reduced  analytically  to 
error  functions  of  complex  argument,  but  it  is  never 
needed  with  an  accuracy  of  more  than  about  four  signi¬ 
ficant  figures  and  is  easily  handled  by  numerical  inte¬ 
gration.  We  have  obtained  satisfactory  results  both 
from  the  use  of  Simpson’s  rule  and  from  Lobatto 
quadratures. 10 
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“Special  points  for  Brillouin-zone  integrations” — a  reply* 

James  D.  Pack  and  Hendrik  J.  Monkhorst 
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It  is  again  emphasized  that  for  ill  cubic  lattices  the  method  of  Monkhorst  and  Pack  generates  not  only 
point  sets  identical  to  those  obtained  by  Chadi  and  Cohen,  but  also  intermediate  sets  with  the  same 
properties.  In  addition,  a  comparison  of  these  two  methods  for  the  hexagonal  lattice  reveals  our  method  to  be 
more  flexible  and  efficient. 


In  reply  to  the  preceding  comments1  by  Chadi 
on  '“Special  points  for  Brillouin-zone  integrations," 
we  feel  the  following  remarks  should  be  presented. 
The  formalism  which  we  have  given3  is  completely 
general  and  can  be  applied  to  any  of  the  5  two- 
dimensional  Bravais  lattices  or  any  of  the  14  three- 
dimensional  Bravais  lattices.  However,  each  lattice 
is  unique,  and  the  method  must  be  tailored  to  fit  the 
symmetry  requirements  of  that  particular  lattice. 
This  consideration  will  result  in  a  reduction  in  the 
number  of  special  k  points  required  by  afactor  rough¬ 
ly  equal  to  the  order  of  the  point  group  at  a  lattice 
site  (at  least  it  will  approach  that  limit  as  q~<*>). 

The  special  treatment  required  for  the  body- 
centered-cubic  (bcc)  lattice  has  already  been  pre¬ 
sented  in  the  Appendix  of  our  previous  paper.2 
Again  we  emphasize  that  when  q  =  2"  the  bcc  point 
sets  generated  by  the  procedure  outlined  in  the 
Appendix  are  identical  to  those  given  by  Chadi  and 
Cohen2  (CC).  These  point  sets  are  more  efficient 
than  those  obtained  by  a  straightforward  application 
of  the  general  formalism  to  the  primitive  bcc  lat¬ 
tice  whose  translation  vectors  are  not  mutually 
perpendicular. 

To  efficiently  treat  the  hexagonal  Bravais  lattice 
we  agree  that  our  method  must  be  slightly  modi¬ 
fied.  We  suggest  the  following  alterations.  Let 
up  assume  the  values 

«,  =  (/>- l)/<7.  ,  =1,2,  3 . .  (1) 

Assign  the  same  values  to  ur,  while  retaining  the 
original  definition  for  u , 

u,-(2s-qc-\)/2qe,  s  *  1, 2,  3, . . . ,  qe .  (2) 

Using  the  primitive  translation  vectors  given  by 
CC  for  the  hexagonal  lattice,  these  special  k  points 
occupy  a  rhombus-prism  Brillouin  2one  <BZ),  but 
reciprocal-lattice  translations  can  bring  them  into 
a  hexagonal-prism  BZ.  The  Du  symmetry  reduces 
the  region  of  interest  to  an  irreducible  zone  (whose 
volume  is  j;  that  of  the  BZ)  defined  by 

0*2u,*ur*l,  Osu,su,si,  0«u,<i.  (3) 

To  have  orthogonal  functions  A, ( k)  the  following 


restrictions  must  be  imposed  on  the  components  of 
Rr 

0«2 R2sRlt  2Rl-R2nqt,  0*R3<jqe.  (4) 

Such  stars  will  satisfy  the  orthogonality  relation¬ 
ships 

^r-y*!wjAm(K)Aj  k,)  =  5M(  (5) 


where  the  weights,  wn  are  obtained  from  sym¬ 
metry  considerations,*  and 


P{q)=PM*)PMc)- 

(6) 

It  can  be  shown  that 

PM*  )  =(a  +  1)(3 a  +/3)  +  63o, 

(7) 

where 

£  =  9*  mod  6  ,  a  =  jig,  -  0) , 

(8) 

and 

PM,)-  *?‘iseven’ 

(9) 

U(<7c  +  1)  if  Qc  is  °dd- 

In  other  words,  as  q  assumes  values  1  through  10, 
P,(q)  assumes,  respectively,  the  values  1,2,  3,  4, 
5,  7,  8,  10,  12,  14;  while  Pc(q)  takes  on  the  values 
1, 1,  2,  2,  3,  3,  4,  4,  5,  5.  The  parameters  q„  and  qc 
are  chosen  to  best  suit  the  problem  and  need  not 
be  equal.  It  is  advantageous  to  restrict  qc  to  even 
values. 

Using  this  method  we  get  point  sets  somewhat 
similar  to  those  of  CC.  To  compare  the  two  meth¬ 
ods  it  is  helpful  to  consider  the  two-dimensional 
hexagonal  lattice.  Cunningham  has  applied  the 
method  of  CC  to  two  dimensions.5  Using  the  cri¬ 
teria  of  CC  that  a  point  set  is  more  efficient  if  it 
eliminates  more  nearest  shells  (rings  m  this  case) 
of  direct  lattice  points,  let  us  compare  the  two 
jnethods.  The  first  set  of  CC  has  three  special 
k  points  and  fails  at  the  fifth  nearest  ring.  Our 
method  yields  a  three-point  set  which  also  fails 
at  the  fifth  ring,  though  the  point  sets  are  differ¬ 
ent.  The  next  set  of  CC  has  six  points  and  fails 
at  the  12th  ring.  Our  method  does  not  give  a  set 
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with  six  points,  but  the  one  with  five  points  fails 
at  the  11th  ring  and  the  one  with  seven  points  fails 
at  the  15th  ring.  The  next  set  of  CC  has  18  points 
and  fails  at  the  31st  ring,  while  our  12-point  set 
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fails  at  that  same  31st  ring.  As  the  number  of 
points  increases  our  method  becomes  increasingly 
more  efficient,  and  also  gives  many  more  point 
sets  from  which  to  choose. 
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Analytic  connection  between  configuration-interaction  and 
coupled-cluster  solutionsa) 
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The  coupJed-vJusier  (CC)  equations  in  ihe  work  of  Coaster,  Kiimmel.  tiiek,  Paldus,  and  others  are 
inhomogeneous,  nonlinear  and  algebraic  in  the  cluster  operators  to  be  determined.  If  taken  to  all  orders, 
they  are  equivalent  to  complete  configuration-interaction  (Cl)  equations,  except  for  stales  orthogonal  to 
the  reference  state  <1>  However,  if  taken  only  to  n  th  order,  they  are  not  equivalent  to  the  n  th  order  Cl 
equations,  and  due  to  their  nonlinear  form,  the  existence  and  the  number  of  the  solutions  is  not 
guaranteed.  Also,  the  reality  of  the  associated  energy  values  is  not  certain  since  these  values  do  not  anse 
as  eigenvalues  of  a  Hermitian  operator.  We  show  that  the  equations  can  be  cast  in  the  form  of  perturbed 
Cl  equations,  with  the  "perturbations"  being  non-Hermitian  and  nonlinear  in  the  Cl-like  coeiTicients  to  be 
calculated.  In  the  case  of  a  finite  number  of  single-parttcle  states,  we  construct  the  solutions  to  the  CC 
equations  by  analytic  continuation  from  the  Cl  solutions.  Singularities  peculiar  to  the  method  are 
identified  and  studied,  jnd  conditions  for  reality  and  the  maximum  multiplicity  of  solutions  are  given.  In 
general,  the  energy  will  be  real,  and  the  number  of  solutions  equals  that  of  the  associated  Cl  problem. 
Singularities  or  instabilities  in  the  coupled-duster  equations  can  be  traced  to  unphysical  assumptions  in  the 
basis  set  Hamiltonian,  or  a  poor  description  to  highly  excited  stales. 


I.  INTRODUCTION 

In  their  landmark  publications  of  1957,  Goldstone1 
and  Hubbard2  proved  the  existence  of  the  now-famous 
linked-cluster  theorem  for  interacting  fermion  systems. 
The  the  rem,  in  effect,  states  that  the  perturbation  cor¬ 
rections  to  the  wavefunction  and  total  energy  beyond  the 
independent- particle  approximation  can  be  represented 
by  linked  Feynman  graphs.  Earlier  Brueckner3  had 
proved  this  to  hold  for  a  few  orders  in  the  interaction 
strength  by  explicit  computation.  Goldstone  and  Hubbard 
generalized  this  to  all  orders.  The  significance  of  the 
linked-cluster  theorem  stems,  in  part,  from  the  pro¬ 
portionality  of  the  energy  corrections  for  a  crystal, 
plasma  or  nuclear  matter  fermions  at  a  given  density 
with  the  number  of  particles.  It  also  provides  a  nice 
bookkeeping  device  for  the  myriad  of  perturbation  cor¬ 
rection  terms. 

Both  Goldstone’s  and  Hubbard's  proofs  use  the  in¬ 
teraction  representation,  thereby  introducing  time  de¬ 
pendence  in  an  intrinsically  time- independent  physical 
problem.  Through  the  “time”  integrations  (resulting 
from  the  adiabatic  switching  of  the  interaction),  the  en¬ 
ergy  denominators  of  perturbation  theory  appear,  and  a 
cancellation  of  unlinked  terms  in  both  the  exact  wave- 
function  and  correlation  energy  results.  Even  for  bound 
states  the  Feynman  diagrammatic  language  of  "forward" 
and  "backward''  scattering  of  particles  is  kept,  al¬ 
though  this  notion  has  only  physical  realism  in  scatter¬ 
ing  phenomena. 

Notwithstanding  its  rigor  and  elegance,  there  are  a 
number  of  disturbing  aspects  about  the  way  the  linked- 
cluster  theorem  was  introduced  and,  subsequently,  has 
been  interpreted  and  applied.  The  first  objection  con¬ 
cerns  the  introduction  of  time  dependence  in  the  deriva- 
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tion.  It  certainly  has  made  understanding  of  the  proof 
more  opaque  from  a  mathematical  standpoint,  and  even 
Goldstone  admits  that  in  his  paper.  Later  Brandow1 
made  the  same  point.  In  fact,  it  even  leaves  many 
authors  still  worrying  to  this  date  about  the  different 
types  of  cancellations  of  terms  that  we  seem  to  have  to 
distinguish. 5 

The  second  objection  is  the  emphasis  on  the  order- by¬ 
order  feature  of  the  theorem  and,  most  importantly,  its 
implementation.  Yet  it  is  well  known  that  for  Coulombic 
forces  in  extended  systems  perturbation  theory  diverges 
in  all  orders  except  the  first  one.  The  standard  cure 
is  partial  summations  of  the  most  divergent  terms  to 
all  orders,  thereby  eliminating  this  unphysical  singu¬ 
larity.  It  is  claimed  as  one  of  the  strengths  of  the 
linked-cluster  theorem  to  find,  through  topological 
arguments,  which  diagrams  should  be  summed  to  in¬ 
finite  order.  At  the  same  time,  in  most  cases,  one 
lacks  physical  insight  into  the  meaning  of  the  particular 
partial  summation.  An  infinite  number  of  infinite¬ 
valued  diagrams  is  always  ignored  thus  casting  doubt  on 
the  choice  of  those  that  are  kept. 

The  third  and  main  objection  is  the  diversion  of  the 
attention  away  from  the  physical  basis  of  the  linked- 
cluster  theorem  to  the  algebraic  and  particularly  topo¬ 
logical  meaning  of  linkedness  or  connectedness.  In  a 
sense,  it  is  a  question  of  representation  in  which  one 
prefers  to  describe  correlation  effects.  Its  choice  is 
inconsequential  only  when  calculations  can  be  carried  to 
completion  in  the  particular  representation.  Unfor¬ 
tunately  this  is  almost  never  the  case,  so  we  are  left 
with  the  choice  of  ihe  "most  efficient"  representation 
leading  to  the  "most  favorable"  convergence.  (Of 
course,  we  entirely  ignore  the  practical  problem  of  our 
limitation  to  handle  only  a  finite  number  of  excitations 
in  the  zeroth  order  spectrum.  But  this  applies  to  any 
representation.  ) 

In  a  many-fermion  configuration  space,  the  exact 
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wavefunction  is  “most  of  the  time"  well  described  by  an 
indcpendent-pnriicle  form.  The  probability  for  two  or 
more  particles  to  come  close  is  small,  and  diminishes 
rapidly  when  an  increasing  number  of  them  are  involved. 
This  is  even  attenuated  by  the  Pauli  exclusion  principle 
that  causes  all  fermions  to  carry  a  Fermi  hole,  even  in 
the  independent-particle  model.  Therefore,  the  energet¬ 
ic  effect  of  this  clustering  of  fermions,  described  by 
so-called  linked  terms,  should  decrease  rapidly  as 
their  size  increases.  There  is  a  significant  probability 
for  many  pairs  of  fermions  to  cluster  (or  correlate) 
with  antiparallel  spin  with  no  correlation  of  their  rela¬ 
tive  positions.  This  leads  to  so-called  unlinked  terms  in 
the  wavefunction,  expressible  as  the  product  of  linked 
terms.  Their  occurrence  is  overwhelmingly  important 
in  large  systems,  and  their  neglect  causes  serious 
errors. 

The  importance  of  the  linked-cluster  theorem  from  a 
wavefunction  point  of  view  is  its  recognition  of  the  fast 
convergence  of  the  energetic  effect  from  above  linked- 
cluster  type  terms  beyond  the  independent-particle 
approximation.  At  the  same  time,  it  includes  products 
of  these  linked  terms  to  higher  orders  in  a  most  con¬ 
venient  manner.  Goldstone  and  Hubbard  were  aware  of 
this  fact,  and  in  Hubbard’s  paper  it  is  brought  out  to 
some  extent.  However,  the  overall  emphasis  remained 
on  the  connection  with  the  quantum-electrodynamical 
methods  of  Feynman,  leading  to  the  order-by-order 
view  of  many- body  effects  and  scattering  pictures. 

Almost  simultaneously,  Coester8  proposed  a  radical¬ 
ly  different  approach  to  the  same  linked-cluster  theory 
that  does  not  suffer  from  any  of  the  above  disadvantages. 
It  is  time  independent;  it  is  algebraically  simple;  and  it 
makes  the  physically  significant  corrections  to  the  inde¬ 
pendent-particle  wavefunction  very  explicit.  The  crucial 
idea  is  to  express  the  exact  wavefunction  in  occupation 
number  formalism  as 

gr  =  e  T<&  =  (1  +  r  +  r2/2!  +  r3/3!  +•••)$,  (1) 

where  the  reference  state  is  defined  by 

+  =  n  hjvac),  t>a=<ra,  (2) 

Hal 

vac)  is  the  vacuum  state  and  ba  are  the  creation  opera¬ 
tors  for  the  fermions  in  the  reference  state.  We  denote 
with  a  the  occupied  states,  and  with  r  the  unoccupied 
states.  The  cluster  operator  T  is  defined  by 

r=£  T„  (3) 

Sm  1 

where 


t,  =  e 

ra 

TJ  =  S  rr'  ^ 

and  the  general  s-particle  cluster  operator  T  is  given 
by 


where  R  is  the  ordered  set  of  s  “particle"  indices 
irt ,  rt,  •  •  • },  and  is  that  for  the  s  "hole"  indices 
{<*ii  <*!,••• f.  Here  and  in  the  following  we  will  use  the 
notation 


We  sometimes  write  <!>*  when  no  confusion  will  arise. 
Ts  creates  s-particle-hole  pairs,  which  makes  Eq.  (1) 
a  nonunitary  transformation  of  4>.  Its  exponential  form 
keeps  track  of  the  counting  over  distinct  corrections  to 

only,  expressed  as  replacements  of  one-particle 
states  by  cluster  functions  and  their  products,  and 
represented  by  Ta  and  products  thereof.  The  power  of 
this  description  results  from  two  aspects.  First,  with¬ 
out  loss  of  generality,  we  can  assume  that  the  T ,  de¬ 
scribe  linked  clusters,  meaning  that  these  operators 
cannot  be  written  as  the  product  of  two  or  more  opera¬ 
tors.  This  assumption  is  consistent  with  the  equations 
that  determine  the  cluster  operators. 

The  second  advantage  is  that  we  can  perform  a  sim¬ 
ple  algebraic  trick.  When  the  Schrodinger  equation  is 
in  our  cluster  expansion  form 

H[eTb)  =  £(er4>)  (7) 

we  can  obtain  its  solution  from  the  equivalent 
expression8 

(e'THeT)$  =  £$.  (8) 

But  now  we  can  express  the  left-hand  side  of  Eq.  (8)  as 
a  finite  commutation  series, 

e'THeT  =  H  +  [H,  T]  +  l[[H,  T],  T] 

+  •  •  •  +  d/4 1 )(([[«,  r],  r],  r),  r],  o) 

The  series  terminates  after  five  terms  since  H  contains 
at  most  two  particle  operators.  Using  simple  commuta¬ 
tor  algebra7  one  can  reduce  those  expressions  even 
further.  In  order  to  obtain  equations  for  £  and  T,  one 
premultiplies  Eq.  (8)  with  (*i  and  and  gets: 


(*i  e-THeT\*)  =  E,  r=£r„ 

Jal 

<$f(s)|e'Ttfer|*>  =  0  (s  =  1 , 2 . ;t). 


Each  wavefunction  nonorthogonal  to  <t>  can  be  ex¬ 
pressed  in  the  form  (1).  However,  usually  one  includes 
only  up  to  n- particle  cluster  operators  which  leads  to 
an  approximation  of  the  exact  wavefunction.  The  cor¬ 
responding  equations  are  of  the  order  *i,  as  indicated  in 
Eqs,  (10)  and  (11).  Moreover,  we  assume  that  the  num¬ 
ber  of  single-particle  states  is  finite.  This  implies  that 
both  the  number  of  particles  K  and  the  number  of  holes 
■V  is  finite.  Solutions  of  Eqs.  (10)  and  (11)  are  hence  ap¬ 
proximate  solutions  of  the  Schrodinger  equation  for  K 
particles  in  a  finite  dimensional  Hilbert  space.  Obvious¬ 
ly  n  s  .V.  However,  the  case  n-S'  is  of  no  practical  in¬ 
terest.  As  shown  in  the  Appendix,  provided  n  -  .V,  Eqs. 
(10)  and  (11)  are  equivalent  to  the  corresponding  finite 
dimensional  configuration- interaction  iCI)  equations  for 
all  states  nonorthogonal  to  Also,  as  n  increases,  the 
number  of  excitations  increases  so  rapidly  that  it  soon 
becomes  impractical  to  solve  these  equations.  Hence  we 
assume  n  <  ,v. 

In  actual  calculations,  Hartree— Fock  or  Brueckner 
orbitals  for  O'  and  r  states  were  used  and  a  very  limited 
number  of  nonzero  Ta  were  kept.  Obviously  as  many 
Eqs.  (11)  are  needed  as  nonzero  /J  amplitudes.  It  was 
found  that  in  ail  electron  systems  considered,  the  pair 
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approximation  T  -  T,  lends  to  about  99'7  of  the  .itt. una¬ 
ble  correlation  energy.  ’  Tilts  is  a  very  reassuring  re¬ 
sult  which,  as  indicated  above,  can  be  well  understood 
on  statistical  grounds. 

In  a  long  series  of  publications  since  1960,  Coester, 
Kiinimel  e/  til . s  have  used  this  method  (named  by  them 
the  exp- ^  method)  for  nuclear  ground-state  energies 
and  properties.  Cizek  and  Paldus1110  were  the  first  to 
apply  the  method  to  a  few  atoms  and  molecules  in  cal¬ 
culations  of  ground- state  correlation  energies.  Be¬ 
cause  of  their  pair  approximations,  the  latter  authors 
adopted  the  terms  coupled-pair  many  electric  theory 
(CPMET)  and  extended  CPMET.  To  stress  the  general 
cluster  aspects  of  the  method,  and  the  significant 
coupling  between  the  clusters  which  results  in  nonlinear 
terms  in  Eqs.  (11),  we  propose  the  name  coupled- 
cluster  (CC)  method. 

As  expected,  it  turns  out  that  by  iteration  of  the 
algebraic  equations  (10)  and  (11)  one  generates  all 
Goldstone-type  linked  perturbation  terms,  provided 
linked  clusters  to  all  orders  are  included.  The  marvel 
of  this  method  seems  to  be  that  it  zeros  in  directly  to 
the  heart  of  the  correlation  corrections  in  a  most  com¬ 
pact,  yet  transparent  manner.  This  applies  to  fermion 
systems  with  any  kind  of  interaction. 


see,  these  are  more  of  numerical  than  physical 
significance. 

In  the  next  section  we  will  present  the  derivation  of 
the  Cl- like  representation  of  the  CC  method  followed  by 
a  section  with  numerical  examples  to  illustrate  some 
analytic  properties  of  the  formulation.  In  Sec.  IV  a 
summary  is  given  of  the  mathematical  analysis  of  the 
general  CC  equations,  the  details  of  which  are  given  in 
the  Appendix.  We  close  wi'h  a  section  in  which  we  point 
out  the  significance  of  the  findings  for  the  CC  equations, 
and  for  the  problem  of  solutions  to  coupled,  nonlinear 
algebraic  equations  in  general.  As  a  byproduct,  we  ob¬ 
tain  the  number  of  Hartree— Fock  solutions  in  a  particu¬ 
lar  atomic  orbital  basis. 

II.  GENERAL 

The  most  general  state  +  in  cluster  expansion  form 
nonorthogonal  to  <l>  can  be  written  as 


*  =e  r<t>  =  (1  +  C)4>, 

(12) 

where  T  is  given  by  Eqs.  (3)— (5),  and 
c  =  £  C„ 

s 

(13) 

=  ]C  ^ •  •  (l*,*la* 

RA 

(14) 

Even  since  its  inception,  Coester  and  Kiimmel’s 
approach  has  been  largely  ignored  by  the  many- body 
theoretical  community  for  a  number  of  reasons.  From 
a  formal  standpoint,  it  seemed  to  address  itself  only  to 
calculating  the  ground-state  correlation  effects.  These 
were  not  considered  as  very  interesting  quantities  com¬ 
pared  to  response  phenomena  of  many-body  systems, 
which  quite  naturally  shifted  the  attention  to  Green’s 
function  methods.  Diagrammatic  perturbation  methods 
are  used  to  compute  these  functions  in  a  more  or  less 
systematic  way.  In  a  recent  paper, 11  we  have  shown  that 
one  can  indeed  use  the  CC  method  to  compute  prop¬ 
erties,  both  time  dependent  and  time  independent,  by 
casting  it  in  Green’s  functionlike  form.  Yet  we  can 
preserve  the  algebraic  and  conceptual  simplicity,  and 
we  get  easily  solvable  equations. 

A  serious  objection  was  also  the  relative  messiness 
of  the  coupled  algebraic  equations  (11).  These  are  non¬ 
linear,  inhomogeneous,  and,  if  not  carefully  handled, 
of  considerable  complexity.  Moreover,  they  are  not  in 
some  eigenvalue  form,  thus  not  guaranteeing  the  real¬ 
ity  and  a  definite  multiplicity  of  their  solutions. 

Coester12  made  some  attempts  to  answer  these 
questions. 

We  have  now  eliminated  this  objection  by  casting  the 
CC  method  in  the  form  of  a  perturbed  configuration- 
interaction  (Cl)  method.  The  perturbation  is  a  non- 
Hermitian,  unbounded  operator  with  polynomial  depen¬ 
dence  on  the  Ct-like  coefficients  to  be  determined. 13 
Algebraically  it  is  somewhat  like  the  Hartree— Fock 
method,  except  that  there  the  Coulomb  and  exchange 
operator  are  Hermitian  and  bounded.  With  the  help  of 
the  analytic  continuation  method  we  will  show  a  one- 
to-one  correspondence  of  the  CC  solutions  with  Cl  solu¬ 
tions,  and  reality  of  E  values  for  real  Hamiltonians. 
Moreover,  many  mathematical  properties  such  as  the 
Appearance  of  singularities  will  be  derived.  As  we  will 


Here  R  and  A  are,  as  before,  the  ordered  sets  of  s 
“particle”  and  “hole”  indices.  In  order  to  cast  the  CC 
equations  in  a  form  more  accessible  to  mathematical 
analysis,  we  make  use  of  the  formal  identities 

eT  -\  +  C ,  (15) 

r  =  ln(l+C),  (16) 


expressing  Tt  in  Cl-like  operators  C,  and  vice  versa. 
According  to  Theorem  1  in  the  Appendix,  the  CC  equa¬ 
tions  (11),  including  up  to  n-tuple  excitations,  can  be 
written  as 


Vl’ifs)  I  (H  -  £){l  +  C,  +  •  ••  +  C, 

+  [c„,(/t)  +  e„j(,,)i}j>i>>=0,  s  =  o,  l ,  2, 


(17) 


where 

C„.,(«)=i;  ~r  r  5(s,  s„  n  +  1)  n  c 

*•2  *  ls{ /  «-i 

C,.2<"'  =  Z)  S  5(S,  +•  •  ■  +S„»!  +  1) 


Mm2 


(18) 


c, 

i.i 


Mm2  k  IJj) 


■  s„  »  *  21 


and  ^B(1  is  the  summation  over  all  possible  choices  of 
S],  s.,  •  •  •  such  that  their  sum  is  in  accord  with  the  5 
function.  Instead  of  the  CC  Eqs.  (17)  we  observe  the 
following  somewhat  more  general  equations. 


iH  + 

aV(*)1*  =  £*, 

(19) 

where 

H  R  A, 

CO) 

*  K-i. 

*•4'  =  '  *4®  KM',  (10 

=  'v4>Ji»l<-\.1(«)*C,.;(-iir;+>ft)rv,00, 

(21) 

=<+J|i  c„». 

(22) 
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It  can  easily  be  shown  that  in  the  case  of  A  =  1  Eq.  (19) 
reduces  to  Eq.  (17).  The  above  equation  is,  however, 
written  in  the  matrix  form,  where  H  is  a  Hermitian 
matrix  with  matrix  elements  given  by  Eq.  (20).  On  the 
other  hand,  V  is  basically  a  non-Hermitian  matrix  with 
matrix  elements  depending  on  the  components  of 
the  column  vector  ♦.  As  pointed  out  in  the  Introduction, 
we  assume  that  the  number  of  single-particle  states  is 
finite. 

Equation  (19)  where  A  is  a  parameter  will  be  called 
the  characteristic  equation  (CE).  For  the  case  A  =  0, 
this  equation  reduces  to  the  usual  Cl  equation  of  order 
n  with  the  subsidiary  condition  4»0<)  =  <4*  i  <fr)  =1.  In  this 
case  Eq.  (19)  is  a  Hermitian  equation,  eigenvalues  £ 
are  real,  and  the  number  of  independent  solutions  41  is 
less  than  or  equal  to  the  dimension  of  the  space  in¬ 
volved.  Our  idea  is  to  use  this  well-known  property  of 
Eq.  (19)  in  the  case  A  =0,  and  by  analytic  continuation 
to  extend  the  corresponding  solutions  to  the  whole  com¬ 
plex  A  plane.  The  particular  case  in  point  is  A  =  1  where 
the  solution  of  the  nth  order  CC  Eqs.  (17)  should  come. 
Thus,  by  that  process,  from  the  existence  and  reality 
of  the  solutions  of  Cl  equations,  we  hope  to  infer  the 
existence  and  reality  of  the  solutions  to  the  CC  equa¬ 
tions.  If,  however,  this  fails,  we  are  at  least  in  the 
position  to  trace  down  the  reason  why  it  is  so. 

III.  EXAMPLE 

In  order  to  get  a  clearer  insight  into  the  structure  of 
the  CE  and  the  different  possibilities  which  can  emerge, 
we  shall  give  an  example  of  a  CE  which  is  simple  en¬ 
ough  to  be  solved  analytically,  yet  incorporates  all  the 
characteristic  features. 

We  consider  a  system  consisting  of  two  particles 
which  can  be  distributed  among  four  different  states. 

In  this  case,  there  are  six  two-particle  states;  one  non- 
excited,  four  singly  excited,  and  one  double  excited 
state  (see  Fig.  1).  If  only  single  excitations  are  ex¬ 
plicitly  taken  into  account,  the  CE  for  the  above  system 
reads: 

v+J(s)  |  (H  -  £){l  +  C,  +  A(C,(1)  +  Cj(l)]}  |  *>  =  0, 

Cj(i)  =  jc?,  c,(i)»frcf,  s-0,1,  (23) 

which  can  be  easily  deduced  from  Eqs.  (17)  and  (18). 

The  single  excitation  operator  C4  can  be  written  in  the 
form 


FIG.  1.  Schematic  representation  of  the  ground  state  i  1  >  a  * 
and  excited  states  I  2  >  through  I  6?  constructed  from  four 
one-part tete  states  (denoted  by  1  to  4),  as  used  in  examples  of 
Sec.  in.  I  2  >  to  I  5  >  are  monoexcited  states,  and  I S  >  is  a 
doubly  excited  state. 


C,  -X2b;<l I  +  Xjfcjrt;  *  v4V,  * 
where 


v  i  =  S' ! c  i !  ■!■) . 


<24) 


This  gives 

CIU)|*>=t*j*s  +  *,*,)|6>,  C 3 ( X )  |<fr)  =0.  (25) 

We  can  now  insert  Eq,  (25)  into  Eq.  (23)  and  write  the 
resulting  equation  in  a  matrix  form, 


ij  =  l . 6.  (26) 

Equation  (26)  incorporates  some  correlation  effects 
since  two  particles  are  involved.  On  the  other  hand,  on¬ 
ly  single  excitations  are  explicitly  taken  into  account, 
and  hence  the  corresponding  CC  method  is  not  equiva¬ 
lent  to  the  Cl  method,  which  is  known  to  have  six 
linearly  indepenoent  solutions  with  real  eigenvalues. 
Thus  Eq.  (26)  is  the  simplest  nontrivial  example  of  the 
CE.  This  model  example  will  prove  quite  useful  in  the 
demonstration  of  the  different  properties  of  the  CE. 


So  far,  matrix  elements  htl  are  restricted  only  by  the 
hermiticity  condition  hu  =  hjt.  We  shall  now  solve  Eq. 
(26)  for  some  special  choices  of  those  elements. 


Our  first  example  is 


fA(xoc5  +  x,x4) 


1  1 

X  2 
*3 

L*5 


(27) 


We  assume  u,  v,  and  q  to  be  all  nonzero  and  obviously 
u’=u.  Matrix  equation  (27)  leads  to  a  set  of  five  equa¬ 
tions  in  five  unknowns; 


ii  +  t;xj  +  i>x4  =  E, 

0  =  Ex j, 

v*  +  \q(x?ti  +  xjt{)  =  Exz,  (27’) 

v*  +  A +  XjX4)  =  £x4l 
0  =  Exf, 

If  we  put  £  =  0  in  Eq.  (27')  we  obtain  solutions  satisfying 
x3(»  +  3)  -  »'*•*»  =  ft'* /a q, 

x4  =  -  (14/11  +  x3),  £  =  0.  u,8) 

For  each  A*0  those  solutions  form  a  three-dimensional 
variety  in  the  four-dimensional  space  spanned  bv 
Vj,  xj,  x4,  and  v,.  We  can  arbitrarily  choose  three 
eigenfunctions  satisfying  Eq.  (28),  and  one  possibility 
is 


1O1O 


(010 
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,  *2= 


(32) 


1 

0 

(-  u  +  D)/2v  , 
(-  u  -  D)/2v 


0 


r  1 

l/x 

4-,=  (-«  +  £»)/ 2u  , 
(-  u  -D)/2v 
0 


*  =  [(-  n'/xq)]172,  Z)  =  ((u2  +  4o2v*/x)]1/2. 

We  note  that  the  space  of  the  eigenfunctions  (28)  is 
not  linear.  If  4»(  and  4^  are  eigenvectors  of  Eq.  (27) 
with  the  corresponding  eigenvalue  E  —  0,  their  linear 
combination  4>  =  +■  (34',  such  that  (4<  1 4>)  =  1  (i.  e. ,  a  +  9 

=  1)  is  generally  not  an  eigenfunction  to  Eq.  (27).  This 
is  the  consequence  of  the  nonlinear  character  of  the 
equations  (27),  namely  of  the  dependence  of  the  matrix 
V  on  the  eigenfunction  4>.  We  can  enlarge  the  notion  of 
degenerate  states  to  the  general  case  that  more  than  one 
eigenvector  4<  corresponds  to  a  given  eigenvalue  E  with¬ 
out  those  eigenvectors  necessarily  forming  a  linear 
space.  In  this  sense,  the  solutions  (28)  are  triply  de¬ 
generate,  forming  a  three-dimensional  hype '■surface  in 
the  four-dimensional  space  spanned  by  xj  to  x5.  As  X 
tends  to  zero,  at  least  one  of  the  components  x ,  =  (4<  li> 
tends  to  Infinity.  Since  (<'!$)  *(4<  II)  =  1,  this  means 
that  4>  tends  to  be  orthogonal  to  the  nonexcited  state 
In  particular  if  in  our  case  we  take  a  limit  of  *i.  *2, 
and  4*3  as  X  tends  to  zero  we  obtain,  up  to  the  normali¬ 
zation  constant: 


Those  three  vectors  are  orthogonal  to  the  vector  <t  and 
though  they  are  not  eigenvectors  of  the  CE,  they  are 
eigenvectors  of  the  corresponding  CL  This  behavior  of 
the  above  solutions  of  Eq.  (27)  reflects  a  general 
property  of  the  solutions  of  Cl.  Since  <4>  l«t)  =  1,  no 
state  orthogonal  to  4  can  be  an  eigenfunction  of  CE. 

Besides  degenerate  eigenfunctions  (28)  corresponding 
to  the  eigenvalue  E  =  0,  Eq.  (27)  has  in  addition  two  non¬ 
degenerate  eigenfunctions  4>4  and  4<s'. 


(31) 

E4(X)  =  {u(u  -  Xq)  +  v([u:  +  4v'(2v  -  Ai?)|)1  n}/(2v  -  \q), 

£5(X)  =  {«(r-  X/7 )-  t/(|y +  4v*(2u  -  Xq)])1 7 *>/ (2u  -  \q), 

where 

x(X)  =  {-  u  +  ([u2  +  4v’(2v  -  Xq)])l/2}/{4v-  2Xq), 

x '  'x)  =  {-  it  -  ([«!  +  4v*(2t>  -  X4)])l/J}/(4i’  -  2Xq). 

It  should  be  observed  that  those  two  eigenfunctions 
represent  one  and  the  same  analytic  function  in  the  com¬ 
plex  x  plane,  and  that  components  of  4»,  and  4fs,  as  well 
as  £,  and  £s,  lie  on  two  different  Riemann  sheets  of  this 
function.  There  are  two  singular  points  of  which  one  is 


a  pole  (for  4'5)  and  the  other  is  a  branch  point, 

Kf  =  2v/q,  X>  =  (/<2  +  9rv’)/4v'q. 

In  the  case  of  X  =  0,  which  represents  Cl,  vectors  4<4 
and  4‘s  reduce  to 


x  =[-  u  +  (it2  +  8UV*)1  /2]/4o, 
Et-[uJr  (u2  +  Bvv*)1  /2]/2, 


~1~ 

0  x‘  =  [- u  -  (ir  +  8vv’)t/2]/4v, 
4.5(°)=  x'  , 

x'  £s  =  [«  -  (it2  +  8vv’)1/2]/2. 

J)  J 


(33) 


If  we  start  with  the  vector  4',(X)  for  X  =  0  and  move 
along  the  line  which  encircles  the  branch  point  Xt  and 
then  come  back  to  the  point  X  =  0,  we  arrive  at  the  vec¬ 
tor  4's(0)  as  indicated  on  Fig.  2.  4>4(0)  and  4>s(0)  are 
mutually  orthogonal  and  their  associated  eigenvalues 
are  real,  which  is  not  generally  true  for  4'4(X)  and  4,S(X). 
Obviously,  4'4(x)  and  4’S(X)  cannot  always  remain 
orthogonal,  since  they  are  analytic  continuations  to  each 
other,  and  in  the  branch  point,  x>t  they  coincide.  Con¬ 
cerning  reality  of  the  corresponding  eigenvalues,  in  the 
case  of  the  real  Hamiltonian  (i.  e. ,  v  and  q  real),  en¬ 
ergy  as  a  function  of  (real)  X  is  real,  as  long  as  we  do 
not  pass  a  branch  point.  Once  X>  \b  (for  the  case  X4>  0, 
and  X  <  X,  if  X*  <  0),  energy  £(X)  starts  to  be  complex 
due  to  the  negative  number  under  the  square  root  in 
Eq.  (31).  If,  however,  H  is  not  real,  energy  EM  is 
generally  complex  for  each  X*0.  Hence  the  reality  and 
non  reality  of  the  eigenvalues  of  the  CC  equations  de¬ 
pends  on  the  basis  in  which  matrix  elements  are  writ¬ 
ten.  If  these  matrix  elements  are  complex,  the  eigen¬ 
value  is  generally  complex.  If,  however,  these  ele¬ 
ments  are  real,  the  eigenvalue  is  real  at  least  in  some 
neighborhood  of  the  point  X  =  0.  This  is  to  be  compared 
with  the  Cl  method  which  always  gives  real  eigen¬ 
values  for  any  choice  of  the  basis  vectors.  We  see  that 
the  most  sensible  basis  choice  for  CC  equations  is  a 
real  basis  where  there  is  much  more  reason  to  believe 
that  the  corresponding  eigenvalues  will  be  real.  As  is 
well  known,  in  the  case  of  velocity- independent  Hamil- 


'1/  (  X ) 
5 


FIG.  3.  Singular  points  in  the  complex  X  plane  and  a  path  ana¬ 
lytically  connecting  the  solutions  +,1X1  and  +  IX)  of  the  ex¬ 
amples  of  Sec.  ill.  4>.(X)  and  4’,(X)  belong  to  different  nicniann 
sheets,  and  coincide  in  the  branchpoint  \s.  X,  and  are  a  pole 
and  hidden  singularity,  respectively. 
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tomans,  such  a  basis  always  exists.  It  should  be  noted, 
however,  that  even  in  the  case  when  H  is  complex,  the 
eigenvalues  arc  not  completely  without  meaning:  The 
imaginary'  part  of  the  energy  is  a  small  quantity  of  first 
order  tn  >  and  as  long  as  X  is  small,  the  predominant 
contribution  to  £(»}  is  given  by  its  real  part,  and  the 
imaginary  part  can  be  considered  as  an  error  since, 
in  any  case,  the  CC  method  is  only  an  approximation. 


One  can  easily  see  that  if  H  is  real  and  if  X,  <  0  the 
CC  equations  will  always  have  real  eigenvalues.  II, 
however,  X,''  0,  the  reality  will  be  insured  by  the  re¬ 
quirement  XjS  1,  l.  e. , 

(«!  +  8t’!)/4v<f  *  1.  (34) 


This  holds  particularly  true  if  q  is  small  with  respect 
to  r  or,  from  Eq.  (27),  if  the  mutual  interaction  of  the 
states  which  are  explicitly  taken  into  account  is  big  with 
respect  to  the  interaction  of  those  states  with  the  states 
which  are  taken  into  account  implicitly  through  the 
matrix  V.  That  is,  however,  in  accord  with  our  intui¬ 
tive  feeling  of  the  validity  of  the  CC  method. 

Our  next  example  is  the  CE, 


u 

0 

0 

0 

0 


0  0\  /0  0 

0  0  \  /o  0 

0  0  ]+ X(xjXs +Xjx4)  2b  0 

0  0/  \  v  0 

00/  \o  0 


0  0  0\ 
000 
0  0  0 
0  0  0  / 
0  0  0/ 


n 


*2 

xi  , 
*< 


‘5-1 


(35) 


where  «  and  v  are  supposed  to  be  different  from  zero. 
Equation  (35)  is  equivalent  to  a  set  of  equations: 

«*,  =  £, 

0  =  £x,, 

u*  +  2Xb(x;X5  +  XjX4)  =  Exj,  (35-) 

\v(xixi+xixi)  =  Exu 
0  =  Exs. 


Concerning  (35')  it  can  be  shown  that  there  are  two 
possibilities:  Either  X * u/v  or  \  =  u/v.  If  \*u/v,  there 
are  two  solutions  to  (35'):  X*«/b, 


“  1  - 

1  - 

0 

0 

(u'/u)'n 

,  *2  = 

-  (i i’/ii)1  /: 

0 

0 

.  0  _ 

0  - 

£,  =  (uu’)'nt 


£,  =  -  (uu*)1/J. 


(36) 

If,  however,  \=u/v ,  there  is  an  infinite  number  of 
solutions  to  (35'), 


4'(X  =»</(•) 


-  1  * 
0 

E/u 

x 

-  0  - 


£  =  UX  ±  (lt2x2  +  MM*)I/2. 


(37) 


Note  that  each  energy  £  can  be  an  eigenvalue,  except 
for  £  =  0.  Later  we  shall  sec  that  it  is  a  general  prop¬ 
erty  of  the  CE  that  sometimes,  for  particular  values  of 
X,  an  infinite  number  of  eigenvalues  are  possible.  We 
shall  call  such  a  point  in  a  complex  x  plane  a  resonance 
point. 

The  two  examples  above  illustrate  different  types  of 
anomalies  which  can  be  expected  for  the  CE.  We  shall 
now  examine  the  CE  in  more  detail  in  order  to  show 
that  the  above  two  examples  exhaust  all  but  one  of  the 
anomalies  which  can  be  expected. 

IV.  GENERAL  STRUCTURE  OF  CE 

In  the  previous  section  we  gave  some  examples  of  the 
CE  in  order  to  illustrate  possible  types  of  solutions. 

Now  we  are  going  to  observe  the  CE  from  a  general 
point  of  view. 

Observe  the  CE 

[H  +  xVCx)ix  =  £x,  x,  *1,  (38) 

where  x  =  {x(f . . .  .x,,}  stands  for  a  vector  4-  whose  com¬ 
ponents  4-Sa  are  renumerated  in  the  order  1, 2, ....  in 
(see  the  Appendix).  The  two  basic  questions  we  want  to 
answer  are  the  existence  of  solution  (£,x)  and  the  real¬ 
ity  of  the  eigenvalue  £, 

In  order  to  reach  these  answers  in  the  most  complete 
sense,  we  must  carefully  analyze  the  mathematical 
structure  of  the  CE.  In  particular,  we  have  to  identify 
singularities  in  the  complex  X  plane.  In  this  section, 
we  summarize  the  results  of  this  analysis.  Most  of  the 
details  are  given  in  the  Appendix,  particularly  the  proof 
of  important  theorems.  We  proceed  by  first  discussing 
the  special  x  points  for  which  the  CE  is  singular  in 
character,  followed  by  rigorous  answers  to  the  above 
questions  of  existence,  reality,  and  multiplicity  of 
solutions  (£,x). 

fa)  Resonance  points 

As  illustrated  by  Example  2,  the  CE  may  have  some 
points  A  for  which  it  has  an  infinite  number  of  distinct 
eigenvalues.  We  called  such  a  point  a  resonance  point. 

It  is  shown  in  the  Appendix  that  if  xr  is  a  resonance 
point,  each  £,  except  for  a  finite  number  of  them,  is  an 
eigenvalue  of  the  Eq.  (38)  where  x  =  x„.  Thus,  between 
resonance  and  nonresonance  points  there  is  a  complete 
symmetry.  In  a  nonresonance  point  there  is  a  finite 
number  of  eigenvalues  £;  in  a  resonance  point  there  is 
a  finite  number  of  £  which  are  not  eigenvalues. 

Another  question  concerns  the  distribution  of  reso¬ 
nance  points  in  a  X  plane.  This  question  is  answered  by 
Theorem  4  to  the  effect  that  there  is  either  a  finite 
number  of  resonance  points,  or  there  is  a  finite  number 
that  are  not  resonance  points.  If  the  first  case  occurs, 
we  call  the  CE  normal.  However,  the  possibility  that 
the  CE  is  not  normal  is  very  unlikely.  It  would  mean 
that  for  almost  every  x  Eq.  (38)  would  have  each  £, 
except  a  finite  number  of  them,  as  an  eigenvalue.  Phvs- 
cally  it  would  imply  that  the  CC  equations,  which  cor¬ 
respond  to  the  point  X  —  1 ,  are  void  of  any  significance, 
since  even  if  x  =  1  were  not  a  resonance  point,  each  I 

point  infinitesimally  close  to  it  wouid  be.  However,  we 
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were  not  able  to  rule  out  this  possibility,  although  we 
believe  that  it  actually  does  not  take  place.  In  any  event, 
if  the  CC  equations  are  to  be  given  any  physical  sig¬ 
nificance,  the  corresponding  CE  should  have,  at  most, 
a  finite  number  of  resonance  points  and  hence  it  should 
be  normal. 

The  solutions  which  are  "specific”  for  a  resonance 
point  we  call  resonance  solutions.  Besides  resonance 
solutions  in  a  resonance  point,  we  can  also  have  non¬ 
resonance  solutions  — the  distinction  being  that  reso¬ 
nance  solutions  cannot  be  extended  in  the  region  outside 
the  resonance  point.  Obviously,  if  each  resonance  solu¬ 
tion  could  be  extended  continuously  to  some  neighbor¬ 
hood  .V(Ar)  of  a  point  \r,  then  in  each  point  of  .V(Ar)  there 
would  be  an  infinite  number  of  distinct  eigenvalues,  and 
hence  this  point  should  be  resonance  as  well.  But  this 
would  imply  that  each  A  except  a  finite  number  of  them, 
is  resonance,  which  would  mean  that  the  CE  is  not  nor¬ 
mal.  Hence,  in  the  case  of  a  normal  CE,  only  a  finite 
number  of  the  solutions  in  a  resonance  point  Ar  can  be 
extended  continuously  outside  this  point  and  those  solu¬ 
tions  we  call  nonresonance.  Thus  resonance  solutions 
do  exist  only  in  a  resonance  point  and  cannot  be  ex¬ 
tended  outside  it.  They  are  isolated,  and  since  the  cor¬ 
responding  eigenvalue  £  can  assume  almost  any  value, 
be  it  real  or  complex,  they  are  void  of  any  physical 
meaning. 

Note  that  the  point  A  =  0,  by  the  very  definition,  can¬ 
not  be  a  resonance  point,  and  hence  in  the  case  of  nor¬ 
mal  CE  there  is  a  small  neighborhood  .V(0)  of  the  point 
A  =  0  where  there  is  no  resonance  point.  This  is  in 
accord  with  the  intuitive  idea  that  for  small  A  the  term 
xV(x)  should  be  considered  as  a  perturbation,  and  hence 
should  not  destroy  the  “good"  feature  of  the  CE  in  the 
point  A  =  0.  One  must,  however,  be  careful  in  such  con¬ 
clusions  since,  although  V(x)  is  continuous,  it  is  not  a 
bounded  operator  as  shown  in  Lemma  7  of  the  Appendix. 
Hence  there  is  no  a  priori  reason  why  aV  for  infinitesi¬ 
mal  A  should  be  infinitesimal. 

lb)  Singular  points 

CE  (38),  for  a  given  (E,x)  and  a  given  A,  is  well 
characterized  by  the  Jacobian 

D(\,E,x) 

1  (f.„  +  A32t'1(x))  ...  (*,„  +  A3"r,(x)) 

xj  (hn-E  +  \32V2ix))  ... 

—>  •  » 


As  shown  in  the  Appendix  (Theorem  5),  if  (£0,x0)  is 
a  solution  of  the  CE  in  a  point  a*,  and  if  a  correspond¬ 
ing  Jacobian  DfA^,  £„x()  is  different  from  zero,  then 
this  solution  can  be  analytically  extended  in  the  whole 
a  plane  with  the  exception  of  a  finite  number  of  points. 
The  solution  [£(a),x(a)|.  which  is  obtained  by  this 


process  of  analytic  extension,  we  call  normal.  Our 
main  concern,  however,  are  those  normal  solutions 
which  are  defined  in  the  point  A  -0,  which  corresponds 
to  the  Cl.  Such  solutions  we  call  regular.  They  can 
bridge  the  gap  between  Cl  and  CC,  since  they  can  be 
extended  out  of  the  point  A  -0,  to  the  whole  A  plane,  and 
provided  they  are  defined  in  the  point  A  =  1 ,  they  connect 
Cl  and  CC  solutions  in  a  natural  way.  The  normal  solu¬ 
tion  (£(A),x(A)j,  which  is  the  extension  of  the  solution 
(E0,x0)  in  the  point  A  =  A0,  is  by  the  very  definition  non¬ 
resonant.  Moreover,  it  is  an  algebraic  function  of  a, 
and  as  such  it  may  have  poles  and  branch  points.  As 
shown  by  Lemma  10,  if  A  =  A6  is  a  branch  point  of  a  nor¬ 
mal  solution  [£(a),.v(a';],  the  Jacobian  D(a4,  £(a»),x(a4)] 
should  be  equal  to  zero.  Also,  according  to  Lemma  8, 
vanishing  of  the  Jacobian  can  be  due  to  a  continuous  de¬ 
generacy  of  the  solution  at  this  point  (see  Definition  3). 

If  \  =  \d  is  such  a  point,  we  call  it  the  point  of  accidental 
degeneracy.  It  is  shown  in  the  Appendix  that  vanishing  of 
the  Jacobian  in  a  point  A  =  A '  indicates  instability  of  the 
normal  solution  [£(A'),x(A')].  Thus  in  a  branch  point  and 
in  the  accidental  degeneracy  point  the  corresponding 
solution  is  unstable.  It  may  also  happen  that  the  solu¬ 
tion  [£(A),x(A)]  is  unstable  in  some  point  a  =  a,  (i.  e. , 
the  corresponding  Jacobian  is  zero),  but  it  is  neither  a 
branch  nor  an  accidental  degeneracy  point.  In  this  case, 
we  say  that  A  =  A,  is  a  hidden  singularity  of  a  normal 
solution  [£(A),x(a)]. 

It  should  be  noted,  however,  that  although  [£(A),x(A)] 
is  not  defined  in  a  pole  A  =  A>(  the  corresponding  Jacob¬ 
ian  may  well  be  defined  in  this  point.  D(x)  *  D(a,  £(A),.v(A)) 
is  an  analytic  function  in  A,  and  there  is  a  possibility 
that  there  exists  an  analytic  continuation  of  this  function 
in  a  point  A  =  A,  which  is  a  pole  of  a  solution  [£(A),x(A)J. 

It  may  happen  that  the  Jacobian  vanishes  in  this  point; 
hence  poles  can  also  cause  disappearance  of  the 
Jacobian. 

In  conclusion  then,  if  [£(A),x(A)]  is  a  regular  (normal) 
solution,  it  may  have  four  types  of  singular  points. 

First  there  are  poles  which  are  those  points  where 
[£(A),x(A)]  is  not  defined.  Second  there  are  branch 
points,  third  there  are  accidental  degeneracies,  and 
fourth  there  are  hidden  singularities. 

!c)  Examples  of  singular  points 

The  above  possibilities  are  illustrated  in  Sec.  HI. 

The  regular  eigenfunction  >ts(A)  Eq.  (31)  has  a  pole  in  a 
point  a,  =  2 v/q  and  a  branch  point  in  A4  =  (if  *  8rr *) 

Ar’q.  In  addition,  there  is  a  hidden  singularity  in  a 
point  A„  =  -  4r:t>*/n!<7,  This  last  singular  point  can  be 
found  if  we  observe  the  corresponding  Jacobian,  which 
is 

D(A)  =  D[A,£(A),  *(A)1 

10)'  »•  0 

x,  -  £  0  0  0 

=  x3  Xffxs  \rjxt-E  Ar?.Vj  \qx2  .  i40) 

Vj  \gx j  A<7Xj  Af/Xj  -  £  Ayv. 

r5  0  0  0  -  £ 

Both  eigenfunctions  ♦(A)  in  Eq.  (31)  satisfy  v-.  =  \  ,-0 
and  ij  =  i,  =  \.  Hence 
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f}(X)=E3[E  +  2x(r-  Xq)].  (41) 

Thus  D(\)  =  0  if  either  £  =  0  or  £  +  2x(n  -  A</)  =:  0.  The 
first  possibility  yields 

£  =  0  -»  X,  =  X,  =  -  4v'v  */u1q,  X5  =  X,  =  2v/q.  (42) 

The  point  X,  is  a  pole  while  the  point  Xlf  being  neither 
a  pole  nor  a  branch  point,  is  a  hidden  singularity.  An¬ 
other  possibility  yields 

E  +  2x(ti  -  q)  =  0  »*  X  =  X,  =  (u!  +  8 vv’)/4vmq,  (43) 

which  is  a  branch  point. 

Concerning  the  eigenfunctions  4't,  ■kj.  and  *j,  Eq. 

(29),  one  finds  D(x)s  0  in  accord  with  Lemma  8.  Thus 
the  vanishing  of  the  Jacobian  is  due  to  a  continuous  de¬ 
generacy.  However,  no  point  X  is  a  point  of  accidental 
degeneracy,  since  the  above  eigenfunctions  are  de¬ 
generate  identically,  i.  e. ,  for  each  X.  Hence  they  are 
not  normal  eigenfunctions  (see  Theorem  5).  In  the  same 
way,  one  finds  D(\=u/v)  =  0  in  the  case  of  a  resonance 
solution  (37).  This  is  in  accord  with  Lemma  9. 

(d)  Physical  meaning  of  singular  points 

The  solutions  close  to  a  pole  or  a  branch  point  can 
obviously  have  no  physical  meaning.  Close  to  a  pole, 
for  example,  at  least  one  component  of  a  vector  x(x) 
tends  to  infinity.  This,  however,  means  that  the  opera¬ 
tor  V  cannot  be  treated  as  a  perturbation  anymore. 
However,  in  the  CC  equation,  V  is  introduced  as  an  ap¬ 
proximation  which  takes  into  account  only  some  of  the 
excited  states.  Hence,  if  a  pole  is  close  to  a  point  x  =  l, 
the  corresponding  regular  solution  is  not  of  much  value. 
Formally,  close  to  a  pole  a  state  x(x)  tends  to  be 
orthogonal  to  a  state  <fr,  and  thus  if  (x  I*)  is  small  for 
some  solution  (E,x)  of  the  CC  equations,  we  should 
regard  this  solution  as  poorly  representing  the  real 
state.  Hence  the  CC  formalism  is  good  only  for  a  few 
low-lying  states,  expecially  for  a  ground  state  which  is 
likely  to  have  significant  overlap  with  a  state  <fr. 

On  the  other  hand,  close  to  a  branch  point  we  have 
two  different  solutions  with  eigenfunctions  whose  over¬ 
lap  approaches  to  1.  However,  at  the  point  X  =  0,  those 
two  eigenfunctions  are  mutually  orthogonal  and  hence 
distinct.  Thus  if  operator  V  is  to  be  considered  as  a 
perturbation,  the  solution  close  to  a  branch  point  can 
also  have  no  physical  meaning. 

(e)  Reality  of  solutions 

The  question  of  the  reality  of  the  solutions,  especial¬ 
ly,  the  reality  of  the  eigenvalue  E,  is  answered  by 
Lemma  12.  Provided  Hamiltonian  H  is  real,  each 
regular  solution  (E(x),x(x)]  is  real  along  the  real  X 
axis,  as  long  as  we  don’t  meet  some  branch  point.  It 
means  that  if  there  is  no  branch  point  between  the  points 
X  =  0  and  x  =  1,  the  corresponding  solution  of  CC  equa¬ 
tions  (x  =  1)  should  be  real.  Note  that  the  essential  re¬ 
quirement  is  the  reality  of  the  Hamiltonian.  As  is  well 
known,  provtded  Hamiltonian  H  is  velocity  independent, 
there  exists  a  real  basis. 

It  is  interesting  to  note  that  the  reality  and  the  non- 
reality  of  eigenvalues  £  depends  to  some  extent  on  the 
basis  in  which  matrix  elements  of  the  Hamiltonian  H 


(and  hence  of  the  operator  IT  are  written.  Different 
orthonormal  bases  can  be  connected  by  unitary  trans¬ 
formations  and  eigenvalues  of  CC  equations  are  general¬ 
ly  not  invariant  with  respect  to  them.  This  can  be  illu¬ 
strated  using  as  an  example  the  CE  <27).  The  full 
Hamiltonian  of  the  system  described  by  this  equation  is 
a  six-by-six  matrix.  Observe  now  the  infinitesimal  uni¬ 
tary  transformation, 

U  —  1  +  i{A ,  =  nit  —  aS3  =  aS4  =  1 

otherwise,  af>  =  0,  i,j  —  1,2 . 6 

and  assume  that  H  is  real, 

=  l'n  =  hn  =  q-Q‘.  (45) 

To  first  order  in  t  the  transformed  CE  reads 

(u  0  v  v  0\  /-  2 kv  0  0  0  OV-trl  * 

0  0  0  0  o\  0  0  0  0  0  1, 

v  0  0  0  0  +  x(*jX5  +  XjX,',l  q  +nP  0  0  0  0  x3 

e  0  0  0  0  y  \q+iiP  0  0  0  0  J  x4 

0000  0/  \0  000  o/JL*,J 


where 

P  =  h8t=(6|H|6>. 

Due  to  (45)  the  eigenvalues  of  the  nontransformed  CE 
(27)  are  real  for  real  x  values  as  long  as  x  «  Xt.  This  is 
obvious  from  Eqs.  (31)  and  (32).  However,  the  eigen¬ 
values  associated  with  the  transformed  CE  (46)  are 
complex,  which  results  from  the  appearance  of  complex 
terms  in  the  operator  V. 

It  should  be  noted  that  the  eigenvalues  of  Cl  equations 
are  also  generally  not  invariant  with  respect  to  unitary 
transformations.  However,  in  the  case  of  Cl  equations, 
eigenvalues  remain  always  real,  while  in  the  case  of 
CC  equations  they  can,  as  shown  above,  assume  com¬ 
plex  values  as  well. 

Unitary  transformation  (44)  mixes  singly  excited 
states  13}  and  14)  with  the  doubly  excited  state  6>. 
However,  we  can  consider  unitary  transformations  of 
one-particle  type,  which  only  mix  occupied  and  un¬ 
occupied  one-particle  states  among  themselves.  Such 
transformations  express  new  creation  (annihilation) 
operators  as  linear  combinations  of  old  creation  (annihi¬ 
lation)  operators,  and  they  change  the  reference  state 
4>  only  up  to  the  phase.  Obviously  CC  equations  (11), 
and  hence  the  CE,  are  invariant  with  respect  to  those 
transformations.  Usually,  however,  a  velocity- inde¬ 
pendent  Hamiltonian  is  written  either  in  the  real  basis 
or  in  some  basis  which  can  be  connected  by  the  above 
type  of  unitary  transformation  with  a  real  basis.  We 
conclude  that,  provided  Hamiltonian  H  is  velocity  in¬ 
dependent,  eigenvalues  £  are  generally  real.  The 
reality  of  the  eigenvalue  can  be  violated  only  if  the  cor¬ 
responding  regular  solution  of  the  CE  has  a  branch  point 
of  the  real  x  axis  between  real  points  \  =  0  and  \  =  1. 
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ft)  Modified  characteristic  equation 

As  mentioned  above,  regular  solutions  of  the  CE  are 
our  main  concern,  since  only  those  solutions  can  con¬ 
nect  Cl  and  CC.  However,  a  given  CE  can  have  very 
few  or  no  regular  solutions  at  all.  In  order  that  a  solu¬ 
tion  [£(A), ■*(*)]  be  regular,  it  is  essential  that  the  cor¬ 
responding  Jacobian  is  not  identical  to  0.  This  is  a 
serious  limitation,  and  we  would  like  to  extend  the 
notion  of  a  regular  solution  to  as  many  solutions  of  the 
characteristic  equation  as  possible. 

This  can  be  done  by  a  slight  modification  of  the 
characteristic  equation.  Instead  of  Eq.  (38)  we  observe 
the  equation 

[tf  +  eA  +  \[V(x)-eA]}x  =  £x,  x,  =  1,  (47) 


where  operator  A  satisfies  requirement  (A45).  We  call 
this  equation  the  modified  characteristic  equation 
(MCE).  For  e  =0  it  reduces  to  the  usual  CE,  while  for 
a  =  1  it  represents  the  CC  equations.  Formally,  Eq. 

(47)  can  be  considered  as  a  characteristic  equation  of 
the  CC  method  and  a  "modified”  Cl  where  Hamiltonian 
ii  is  replaced  with  H’  =  H  +tA,  while  at  the  same  time 
operator  V  is  replaced  with  V'= 

All  the  conclusions  concerning  the  behavior  and  struc¬ 
ture  of  the  solutions  of  CE  do  apply  to  the  solutions  of 
MCE  as  well.  As  shown  in  the  Appendix,  for  small 
enough  «,  which  we  assume  to  be  the  case,  MCE  has 
exactly  in  regular  solutions  [£(x),x(x)).  Each  of  those 
solutions  can  have  a  finite  number  of  singular  points 
which  are  of  four  types:  pole,  branch  point,  accidental 
degeneracy,  and  hidden  singularity.  In  a  pole  the  solu¬ 
tion  (£(X),x(X)]  is  not  defined  and  at  least  one  compo¬ 
nent  of  a  vector  x(X)  blows  up.  Thus  the  pole  corre¬ 
sponds  to  an  eigenfunction  which  is  formally  orthogonal 
to  +.  In  each  neighborhood  of  a  branch  point  there  are 
at  least  two  solutions  which  smoothly  approach  each 
other.  In  an  accidental  degeneracy  point  there  are  at 
least  two  solutions  with  the  same  energy.  Finally,  if 
the  point  X  is  a  hidden  singularity  point,  none  of  the 
above  holds,  but  the  solution  [£(X),x(X)]  is  unstable  and 
the  corresponding  Jacobian  vanishes. 


Geometric  interpretation  of  solutions 


For  each  X  the  CE  is  a  set  of  m  equations  in  m  un¬ 
knowns.  Geometrically  we  can  think  ol  those  equations 
as  describing  m  hypersurfaces  in  an  w-dlmensional 
•Pace  .V".  Each  solution  is  then  a  common  intersection 
those  hypersurfaces.  Such  a  picture  can  help  us  to 
•nderstand  the  nature  of  resonance  points  and  different 
trpes  of  singular  points.  In  Fig,  3  are  shown  those  dif- 
•'rent  possibilities  and  their  geometrical  meaning.  This 
fvare  deals  with  the  simple  case  where  we  have  only 
hypersurfaces,  /,(X,£,x)  =  0  and  /-(X,  £,x)  =  0.  In 
t*.  3  those  hypersurfaces  for  a  given  x  value  are 
,f»*n.  Figure  3(a)  corresponds  to  a  crossing  of  all 
^>ersurfaces  in  a  given  point.  In  this  point,  D*  0  and 
11  is  a  nonresonance  solution.  In  Fig.  3(b)  the  two 
■Resurfaces  do  not  cross;  or,  if  one  prefers,  they 
ln  infinity.  Hence  there  is  no  solution.  However, 
Jacobian  still  can  be  defined  as  the  analytic  con¬ 
ation  of  the  Jacobian  in  some  neighboring  x  point. 


It 


D/O  X 
Regular  Solution 

(a) 


0*0  X 
Resonance  Pom! 

(d) 


Pole 

(b) 


o*o 

Degeneracy 


LX 


0*0  X 
Hidden  Singularity  or 
Branch  Point 

(e) 


FIG.  3.  Graphical  depictions  of  possible  solutions  as  intersec¬ 
tions  of  ni  hypersurfaces  m  m-dimensional  space,  and  their 
associated  values  of  the  Jacobian  D. 


In  Figs,  3(c)  and  3(d)  both  hypersurfaces  do  coincide 
which  corresponds  in  a  more  general  case  to  the  de¬ 
pendence  of  hypersurfaces.  In  the  former  case,  since 
£  is  always  constant,  we  have  degeneracy,  while  in  the 
latter  case  it  is  a  resonance  point.  Finally,  Fig.  3(e) 
can  describe  either  a  hidden  singularity  or  a  branch 
point.  In  order  to  distinguish  those  two  possibilities, 
we  should  draw  at  least  a  three-dimensional  diagram 
including  X  axes  as  well.  In  the  first  case  (hidden 
singularity)  the  two  hypersurfaces  should  only  touch  in 
some  point.  In  a  branch  point,  as  we  proceed  along  the 
x  axes  starting  from  a  point  of  touch,  we  should  con¬ 
tinuously  ai  rive  at  two  distinct  crossing  points  corre¬ 
sponding  to  two  solutions.  It  should  be  noted,  however, 
that  in  reality  two  axes  correspond  to  each  axis:  One 
for  a  real  part  and  another  for  an  imaginary  part.  Fig¬ 
ure  3  is  hence  only  approximate. 

V.  DISCUSSION 

Exploiting  the  analytic  connection  of  the  CC  with  the 
Cl  method,  we  have  come  to  the  following  main  con¬ 
clusions  concerning  the  solutions  of  CC  equations. 

(1)  Each  MCE  has  as  many  regular  solutions  as  the 
corresponding  Cl.  Here,  there  are,  in  general,  as 
many  solutions  to  the  CC  equations  as  the  number  of 
solutions  to  the  associated  Cl  equations.  The  particular 
Cl  solution  can  be  "lost”  only  if  the  point  x  =  1  is  a  pole 
or  a  branch  point  of  a  corresponding  regular  solution  of 
MCE. 

(2)  Unless  a  branch  point  exists  on  the  real  axis  in  the 
complex  x  plane  between  0  and  1  the  energy  eigenvalue 
will  be  real  for  a  real  Hamiltonian.  If  the  Hamiltonian 
matrix  is  complex,  the  energy  might  become  complex. 

(3)  Solutions  to  the  CC  equations  assoc  ated  with  high¬ 
ly  excited  states  tend  to  become  more  and  more 
orthogonal  to  ♦.  They  will  also  describe  these  states 
poorly.  Therefore,  it  becomes  more  likely  that  those 
excited  CC  solutions  will  show  singularities  close  to 

x  =  1.  Whether  this  causes  the  CC  solution  to  be  not 
defined  (x  =  1  is  a  pole)  or  has  a  complex  energy  value 
(branch  point  between  0  and  I  on  a  real  axis)  is  depen¬ 
dent  on  the  Hamiltonian  matrix  elements. 
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The  render  will  have  noticed  that  little  use  was  made 
of  the  actual  algebraic  form  of  Eqs.  (38),  peculiar  for 
the  CC  equations.  This  is  particularly  the  case  when 
discussing  the  occurrence  of  resonance  points,  poles, 
and  branch  points.  As  evidenced  by  Lemmas  6  and  9  in 
the  Appendix,  it  is  important  for  proving  the  reality  of 
the  solution,  but  only  to  the  extent  that 

V(x*)  =  V*(x). 

This  suggests  that  the  developed  procedure  of  analyti¬ 
cally  constructing  the  solutions  to  nonlinear,  algebraic 
equations  from  a  known  problem  (in  this  case  the  linear 
Cl  equations),  is  quite  general.  Thus,  with  some  modi¬ 
fication  this  method  can  be  applied  to  Hartree— Fock 
(HF)  equations  as  well.  The  modification  is  required 
since  HF  equations  are  not  analytic  in  the  unknowns. 
This  follows  from  the  fact  that  the  HF  potential  depends 
on  the  products  of  the  components  of  the  eigenvectors 

with  complex  conjugates  of  those  components.  How¬ 
ever,  this  difficulty  can  be  overcome,  and  essentially 
the  same  method  of  the  characteristic  equation  can  be 
applied.  One  finds  that,  if  HF  equations  describe  a 
system  consisting  of  S  particles,  and  if  there  are  Vf 
one-particle  states  (occupied  plus  unoccupied),  then  the 
characteristic  equation  corresponding  to  this  HF  has 
(J)  regular  solutions.  In  order  to  obtain  this  result,  it 
is  necessary  to  redefine  the  characteristic  equations 
and  the  definition  of  the  regular  solution  in  an  appro¬ 
priate  way.  This  will  be  done  elsewhere. 14 

APPENDIX 

Lemma  1:  Let  T  and  C  be  cluster  and  configuration 
operators  defined  by 


t„  c  =  2  c„ 

Ml  Ml 

where  7,  and  C„ 

*  *  1 


c.= 


2  c„i  ...J  ft,  <  •  •  * 


(Al) 


(A2) 


rr=2  7"/,-!  =2  (2  7" A */ i ! 

i.o  i.o  \/.>  )/ 

=  1  +  7,  +  (Tj  +  7;  '2)  -  (T,  +  7,  T,  -  7-J/6)  +  ••• 

=  i  +  £  IE  rr  £  6<si +  •  •  • +  *)  n  r 1.  (A6) 

j»i i  f«i  J 


where  the  last  expression  can  be  obtained  by  a  simple 
combinatorial  analysis.  Comparison  with  (A3)  proves 
(A4)  now. 

The  inverse  relation  (A4')  can  be  proven  along  the 
same  lines  using  the  operator  relation 


r=ln(l+C)  =  2  —  c‘. 

1.1  1 


(A7) 


(A7)  is  formally  valid  only  if  iCl  <1.  However,  if  the 
right-hand  side  in  (A7)  is  applied  to  any  state  4  consist¬ 
ing  of  a  finite  number  of  excitations  this  series  is  finite 
and  hence  (A7)  holds.  Thus  (A4')  deduced  from  (A7)  is 
valid  for  all  finite-dimensional  cases. 


Theorem  1 :  Let 
(♦£(*)! «'r(Tf-  £)er|4>)  =0, 
r=r,+...  +  r„  s=o,i . «, 


(A8) 


be  a  set  of  CC  equations  including  up  to  n  excitations. 
This  set  of  equations  is  equivalent  to  the  Cl- like 
equations, 


<*J(s)  I  (77  -  £){1  +  C  +  [C„.,  (n)  r  C„2{n)]}  \  *)  =  0, 
C=C,  +  -..+C„,  s=0.1 . n, 

where 

*♦1 

C..,(n)  =  2  £  5(Si  +  ' 

uZ  *  {*<) 

=  2  ~j~~  £  ®(si  +  1 

k.2  *  I*,) 


(A9) 


+  s»t«+  0  n  c 

i.i  * 

* 

■  +  sk9n+l)Ct  n  C3{  (A10) 


.  2  i-Jl 

m3  * 


2  6(s, 


+  s.,n-2)  n  C  . 

i.i 


are  operators  creating  linear  combinations  of  s-tuple 
p-h  excitations.  Further,  let  T  and  C  be  connected  by 
the  operator  relation 

er  =  l+C.  (A3) 

The  following  two  operator  relations  hold: 


C,  =  £  Jr  2  6(s,+-..  +  s»,s)  n  T 

».l  <#|)  i.l 

7>£^-  S9(s,  +  ...+s„s)  n  c,(. 

Ml  *  Is,  1  I.I 


(A4) 

<A4') 


In  Eq.  (A4)  and  (A4')  the  sum  extends  over  all 

possible  combinations  of  k  numbers  s,,  s2 . s,  such 

that  s,  +  Sj  +  ...  +  s,  =  s. 

Proof'.  Due  to  (A2)  we  have 


Operators  C,  and  7,  for  s  s  it  are  connected  by  the 
relations  (A4)  and  (A4'). 

Proof:  Let  the  space  of  states  consist  of,  at  most, 
.V-tuple  excitations.  Observe  (A3).  Formally  7,  =  C,=  0 
if  s  >  .V,  while  for  the  case  s  i  V  the  connection  between 
C,  and  7,  is  given  by  (A4)  and  (A4').  Now  let  T  termi¬ 
nate  after  the  nth  term  7  =  7,  +  7\  +  •  ■  •  +  7„,  and  let  us 
express  C,  in  terms  of  such  T.  In  order  to  denote  that 
C,  depends  on  n  we  will  write  Cs(i i).  From  (A4)  it 
follows  that  0,(111  =  0,  if  s  *  n.  Let  us  now  look  at  0,.,(»). 
This  operator  has  the  same  dependence  on  7,  as  the 
operator  C„  provided  we  put  7,.,  =0  in  (A4).  Hence, 
from  (A4), 

C,.|(n)  =  C„,  -  T„(.  (All) 

By  the  same  argument,  we  obtain 


tc„c,|=[r„r,i=o,  s,p  =  1,2, •  •  • 


(A5)  C„j(«)  =  C,.2  -  (7,.j  *  7,  7,.,). 


(A12) 


and  hence 


From  (A4), 
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(A21) 


i  r  =  c..,  *  E  E  5(s,  +  •••  +  s„»»  + 1)  n  c 

J  •  -I  K  («,i  i.i  1 


I 


:  c„.2  -  ctc x.i +  E  Mr"  E  ^(s,  +  •••  +  *»,«  +  2) 


nc 

i.i  * 


;fl  ■ 

i  jnd  hence  Eq.  (A10)  follows.  This  proves  that  (A8)  is 


(A13) 


I 


equivalent  to 

*l{s)\e'T(H  ~  £Hl  +c  •|-[C„.t(n)  +C„j(n)|}4)  =0, 

CsC,+- ••+C«  *»0, (AH) 

»here  C„.,(n)  and  C„2(n)  are  given  by  (A10).  Operators 
C^s,  Cmi,  etc.,  creating  (n  +  3)-,  (n  +  4)- tuple  excita- 
JnS  do  not  enter  into  (A14),  since  Hamiltonian  H 
destroys  at  most  two  excitations,  operator  expi-T")  can 
or.ly  create  excitations  and  the  state  <<t>2(s)l  contains  at 
c-.ost  >i  excitations. 

By  induction,  we  now  conclude  that  operator  exp(-  T) 
pan  be  omitted  in  (A14).  First,  observe  the  case  s  =0. 
Obviously, 

*>{1  +C  +  [C„.,(n)  +  C„.2(»)]}|*> 

*(*[(//-  £}{l  +  C  +  [C„.,  (n)  +  C„2  (n)  ]}  1 4-)  =  0,  (A1 5) 

jince  exp(-  T)  - 1  -  T  +  7*/2!  +  ■  •  •  can  only  create  ex¬ 
citations  and  there  is  no  excitation  in  ($1.  Second,  we 
have 

*;(l)|e-r(//-E){l+C  +  [C,.,(«)-t-C„2(n)]}|*> 

» <*J(1)  I  W -  E){\  +  C  +  +  C„.2(«)]}  | *) 

<*2tt>  I  TW -  E){l  +  C  +[C„t(n)  +  C„,2  (,.)[}  |  *)  »  0. 

(A16) 

Operator  T,  when  acting  on  any  ketl4<),  creates  at 
least  one  more  particle-hole  pair  in  it.  Therefore,  ac¬ 
cording  to  (A15)  the  second  term  in  (A16)  is  zero  and 
hence 

<+J (1)  I W  -  E){  1  +  C  +  [Cml(n)  +  C„2(n))} !  *}  =0.  (A17) 
By  induction  it  follows 

'.♦2(«>| W  -  E){1  +  C  [C„,(n)  +  C„.2(n)]}  |*>  =0, 

s  =  0, 1,...,».  (A18) 

This  set  of  equations  is  equivalent  to  (A14)  and  hence  to 
'A81.  Theorem  1  is  thus  proved. 

A  kind  of  inverse  theorem  also  holds: 

Theorem  2:  Let 

*J(s)!(h-£)(i+c)|+)=o, 

+  C„,  s  =  0, 1, ....  n 


C  =  C,  +  - 


(A3  9) 


he  a  set  of  Cl  equations  including  up  to  n-tuple  excita¬ 
tions.  This  set  of  equations  is  equivalent  to  the  CC-like 
^uations 


*i(s)|e'r'",(W-  E)eI’‘",|<t>>  =0,  s=0,l . n, 

f 

^ ‘here 

i 

T(»)  S  r,  +  •  •  -  +  r,  +  T'.t  (n)  *  T„t(n), 


(A20) 
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».2  *  •  (l,l  I.I  1 

E.2(»)  =  Et 7  E  S(s, +  •••  +  .«„, i  +  l)T,  1*1  T 
-  E  S  8(sj +  •■•  +  *»,» +  2)  n  r 

»-3  li,l  1.1  ‘ 


Operators  Ca  and  7",  for  s«n  are  connected  by  the  rela¬ 
tions  (A4)  and  (A4*).  The  above  theorem  can  be  proven 
along  the  same  lines  as  Theorem  1.  Both  theorems  ex¬ 
press  a  formal  symmetry  between  Cl  and  CC  equations. 


Observe  the  equations: 


<*2(s)|(ff-*Ki  +  c  +  -.-  +  c. 

+  x[C,.1(n)  +  C,.2(n)]}|4.)  =  0,  (A22) 

$30,1,..,  ,n. 

These  equations  explicitly  include  up  to  n-tuple  excita¬ 
tions  while  (>i  +  1)  and  ( n  2)-tuple  excitations  are  in¬ 
cluded  through  the  operators  C„,(n)  and  C„.:(n),  which 
depend  on  C,  to  C„,  according  to  (A10).  For  X  =  1  this 
set  of  equations  reduces  to  (A9),  hence  it  is  equivalent 
to  CC  equations  including  up  to  nth  order  clusters  Ts. 
For  A  =  0,  Eqs.  (A24)  reduce  to  a  usual  Cl  problem 
including  up  to  n-tuple  excitations. 

Lemma  2:  Equations  (A22)  can  be  written  in  the 
matrix  form 

[tf+XFU .)]*  =  £*,  **-1, 

where  H  and  V  are  matrices  with  elements 

»*a,W <*2I»I*S>, 

=  (4-*l«lCn.,(n)*C  *♦2  (n)]|*>6*.4.,oo, 
and  't  is  a  column  vector  with  components 


(A23) 


(A24) 


**a(s) = <*5(s)  I  c,  |  +> = ,  s = o,  1 . n. 

(A25) 

Lemma  2  follows  directly  from  (A22).  The  require¬ 
ment  4,()0  =  1  fixes  the  norm  and  phase  of  4.  Formally 
(A23)  is  an  eigenvalue  equation.  Note  that  according  to 
(A24),  operator  V  is  not  Hermitian,  and  hence  the 
reality  of  E  is  not  guaranteed.  We  will  call  Eqs.  (A22) 
and  (A23)  characteristic  equations  (CE).  We  shall 
analyze  the  solutions  (eigenvectors  and  eigenvalues)  of 
the  CE  as  a  function  of  x.  Besides  CE  (A23)  we  will 
sometimes  consider  the  equation 

{H  m  eA  a(  Vfq*)  -*Al}4  =  £*,  >k00  =  l,  (A23') 


where  A  is  a  real  Hermitian  operator.  Later  on  [see 
Eq.  (A45)|  we  will  impose  some  additional  requirements 
on  A.  We  will  call  Eq.  (A23')  the  modified  characteris¬ 
tic  equation  (MCE).  Obviously,  for«=0  this  equation 
reduces  to  CE.  Note  also  that  for  x  =  1  it  reduces  to  CC 
equations.  In  what  follows,  we  wiU  refer  mainly  to  CE. 
However,  all  the  conclusions,  unless  otherwise  speci¬ 
fied,  apply  to  MCE  as  well. 

We  now  first  examine  some  properties  of  the  operator 
1’  and  its  matrix  elements  r„4. 


r 
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Lemma  3:  Each  V 94  is  either  identically  zero  or  it  is 
a  polynomial  in  the  components  of  the  vector  4. 

(a)  All  matrix  elements  VS4(s),  corresponding  to  the 
s-tuple  excitations  where  s  <n~  1  are  identically  zero. 

(b)  If  there  is  no  (n  +  l)-tuple  excitation  l*4((»i  +  1)) 
such  that 

<*Z(n- l)|/f|*?(n+l))*0,  (A26) 

the  matrix  element  VRtt(n-  1)  corresponding  to  (n  -  11- 
tuple  excitations  14f(n-l)>  is  identically  zero.  Other¬ 
wise  V Rti(n  -  1)  is  polynomial  in  the  components  of  *  of 
degree  (n  + 1)  and  with  smallest  power  2. 

(c)  If  there  is  no  (n  +  l)-tuple  excitations  i4f.(n  +  l)> 
such  that 

<>fr£(/!)|tf|<&S(rt+l))*0  (A27) 

and  at  the  same  time  no  (n  +  2)-tuple  excitations 
i  +  2)>  such  that 

\4j(n)|tf|*?(»  +  2)>*0  (A28) 

matrix  element  VKA{n)  corresponding  to  n-tuple  excita¬ 
tions  is  identically  zero.  If  only  (A26)  holds, 

is  a  polynomial  in  the  components  of  4  of  the  de¬ 
gree  (n  +  1)  with  smallest  power  2  (same  case  as  <b)J. 

If  (A29)  holds  Fg4(n)  is  polynomial  in  components  of  4 
of  degree  (n  +  2)  with  smallest  power  2. 

Summarizing,  Lemma  3  states  that  unless  VKa  is 
identically  zero,  it  is  a  polynomial  of  4  components  of 
degree  at  most  (n  +  2)  and  at  least  2. 

Lemma  4:  Polynomial  PS4  contains  each  component  of 
4  with  power  of  at  most  1. 

Lemma  5:  If  the  Hamiltonian  H  is  real,  matrix  ele¬ 
ments  Psa(*)  satisfy 

=  (A29) 

Lemma  6:  H  the  Hamiltonian  H  is  real,  and  if  (E0,  4g) 
is  a  solution  of  CE  for  X  =  X0,  then  (EJ,4o*)  is  a  solution 
of  the  same  CE  In  the  point  X  =  Xf. 

All  these  Lemmas  follow  from  (A10)  and  (A24).  For 
example, 

-  1)  =<+*(«  -  1)  |  WlCMl(n)  +  C„.j(n)]|  *) 

=  E  E  ®(s,  +  •  •  •  +  s,,  n  +  1) 

*.2  *  fjf) 

x<*£(n  +  l)  n  C,.|*>.  (A30) 

(.1 

If  in  the  above  expressions  there  is  at  least  one  vector 
i+S’(n  +  l)>  such  that  relation  (A26)  holds,  the  term  in 
(A30)  corresponding  to  k  =«  +  1  gives  rise  to  the  highest 
power  (>i  +  1)  in  the  components  of  4.  The  lowest  power 
is  2,  corresponding  to  the  term  with  k  =  2  in  (A30).  In 
the  same  way,  matrix  elements  V nA(n)  can  be  analyzed. 
Lemma  4  follows  from  the  fact  that  nj  =0  for^each 
Fermion  annihilation  operator  o\ |.  Since  operators  C, 
can  only  create  particle- hole  pairs  and  never  destroy 
them,  once  such  pairs  are  created  the  repeated  applica¬ 
tion  of  the  same  p  -  h  creation  operators  on  the  new 


state  gives  zero.  Lemma  5  can  be  deduced  straight¬ 
forwardly  from  the  explicit  expressions  for  V such 
as  (A30),  and  from  the  assumed  reality  of  the  matrix 
elements  of  H.  Lemma  6  follows  from  Lemma  5.  The 
following  lemma  is  also  of  some  interest. 

Lemma  7:  Operator  V  is  either  identically  zero  or  it 
is  nonbounded. 

Proof:  Observe  first  that  according  to  (A23)  operator 
V  is  defined  only  for  'l'  such  that  400  =  1.  From  Eq. 

(A24)  it  follows  that 

(A31) 

Assume  now  that  V  is  not  identically  zero.  According  to 
Lemma  3,  there  is  such  that  VV4>  is  a  polynomial 

in  the  components  of  4'  of  the  degree  at  least  2.  Hence 


I  ^  I  "  Zr4  I  ^RA  '  ""  £r4  I  'J'Ra  I  2 


(A3  2) 


However,  I  I2  is  a  polynomial  of  at  least  fourth 

degree,  while  £ra  I4R4i2  is  a  polynomial  of  a  degree  2, 
and  hence  the  right-hand  side  in  Eq.  (A32)  is  not  bound¬ 
ed.  It  follows  that  V  is  not  bounded  either. 

Observe  that  although  V  is  not  bounded  it  is  continu¬ 
ous  since  its  matrix  elements  are  polynomial  (con¬ 
tinuous)  functions  of  the  components  of  a  vector  4. 

We  would  now  like  to  examine  the  properties  of  CE  as 
a  function  of  the  parameter  x.  Examples  given  in  Sec. 

IH  suggest  the  following  definitions: 

Definition  1:  Point  X,  in  a  complex  x  plane  is  a  reso¬ 
nance  point  Of  a  CE  if  for  X  =  X,  the  CE  has  an  infinite 
number  of  eigenvalues. 

Definition  2:  Point  X„  in  a  complex  X  plane  is  a  non- 
resonance  point  of  a  CE  if,  for  x  =  x„  the  CE  has  a 
finite  number  of  eigenvalues. 

The  emphasis  in  Definitions  1  and  2  is  on  the  number 
of  eigenvalues  and  not  on  the  number  of  eigenfunctions, 
which  can  be  infinite  due  to  degeneracy.  It  is  clear  that 
X  =  0  is  a  nonresonance  point. 

We  now  prove  the  following  theorem. 

Theorem  3:  Let  x,  be  a  resonance  point  of  a  CE.  Each 
value  E,  except  a  finite  number  of  them,  is  an  eigen¬ 
value  of  the  CE. 

Proof:  In  order  to  prove  this  theorem,  it  is  more 
convenient  to  renumerate  the  components  4B4  of  a  vec¬ 
tor  4  in  such  a  way  that  we  first  take  component  400  =  1 , 
then  in  some  definite  order  all  components  correspond¬ 
ing  to  single  excitations,  then  all  components  associated 
with  double  excitations,  etc. ,  up  to  (/-tuple  excitations. 
We  denote  the  renumerated  components  by  -vf,  i 
=  l,...,'n.  With  this  enumeration  understood,  we 
write,  instead  of  *,  the  vector*.  Thus  (A23)  reads 

[H  +  XF(x)]  =Ex,  *,  =  1.  (A33) 


where  H  and  V  are  »i*m  square  matrices  with  elements 
given  by  (A24),  using  the  correspondence  4Si  —  v(. 

The  matrix  equation  (A33)  is  a  system  of  m  equations  in 
m  unknowns: 
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(A3  4) 


ber  of  them.  But  then,  since  ft  arc  polynomials,  (A37) 
should  be  satisfied  for  all  a  with  (lie  exception  of  a 
finite  number.  This  proves  Theorem  4. 


'i  ;  ;'ti  *  u\t'z  *  •  •  •  +  *  W\{x)-  £-  0, 

h  5  //:i  +  +  •  •  •  +  /j..v.  *  A I ;(v)  -  Ev  =  0, 

0  /,  z,!«i+  +  •  •  ■  -*■  -1-  A  I'„ (v)  -  Ex„  =  0, 

where  the  unknowns  are  £,x, . x„.  Each  polynomial 

/,  can  be  uniquely  factorized  in  a  set  of  t(f)  irreducible 

polynomials//',  ;,  =  1 . f?(i).  For  example,  the 

polynomial 

/=X?  -  x3  *X,x5  -  xfc,  +x2  -  x,=  0 
can  be  factorized  as 

f=  (xf  -x?-  IKxj-x,)  =  0. 

Hence  the  set  (A34)  is  equivalent  to  A'si"i,  «(i)  sets  of 
equations 

//■  =  0,  /,  =  X,  2, . . .  ,*(1), 

fz-  =  0,  j,  =  l,2 . *(2),  (A3 5) 

fim  =  0,  ;„  =  1, 2 . k(w). 

Each  equation  in  (A35)  represents  a  hypersurface  in  the 
or-dimensional  space  A"*  spanned  by  £,x,, . . .  ,xm. 
Geometrically,  the  solution  of  (A34)  is  the  joint  inter¬ 
section  of  all  hypersurfaces //'  •  •  Assume  now 
that  the  point  A  =  A,  is  a  resonance  point  of  (A34).  In 
this  case,  there  is  at  least  one  set  (A35)  which  has  an 
infinite  number  of  intersections  in  A™  with  different 
values  of  E.  Due  to  the  algebraic  structure  of  the  //' 
this  can  happen  only  if  this  set  is  dependent.  But  this 
means  that  it  can  be  solved  step  by  step  to  lead  finally 
to  an  equation  of  the  type 

.*(-! <*(),*!,*,.,(*,) . *„(*()]  =  0,  (A3 6) 

which  is  algebraic  in  E, x3, , . .  ,x„,  where  all  x,(x()  are 
algebraic  inx,.  Therefore  (A36)  is  algebraic  in  E  and 
r,,  and,  being  satisfied  for  a  denumerable  number  of 
distinct  E  values,  it  is  satisfied  for  each  £  except, 
possibly,  for  a  finite  number  of  them.  This  proves 
Theorem  3. 

Related  to  this  Theorem  is  the  following  theorem. 

Theorem  4:  Each  CE  has  either  a  finite  number  of 
resonance  points,  or  each  A  is  a  resonance  point  except 
for  a  finite  number  of  them. 

This  theorem  is  proven  along  the  same  lines  as  the 
preceding  one.  Assume  namely  that  a  given  CE  has  an 
infinite  number  of  resonance  points.  From  this  infinite 
number,  we  can  extract  a  denumerable  subset  of  them. 
According  to  Theorem  3,  for  each  resonance  point  the 
CE  is  satisfies  for  all  £  except  for  a  finite  number  of 
them.  It  follows  that  for  a  denumerable  number  of 
resonance  points  there  is  a  set  consisting  of  all  £  ex¬ 
cept  for,  at  most,  a  denumerable  number  of  them. 
However,  due  to  the  algebraic  character  of  the  CE 
exclusion  can  apply  only  to  a  finite  number  of  them. 

This  means  that  the  equations 

/,(A,£,x)  =  0  (A37) 

*r*  satisfied  for  a  denumerable  number  of  A  values 
*nd  for  all  £  with  the  possible  exception  of  a  finite  num- 


It  is  very  likely  that  the  number  of  resonance  points 
can  only  be  finite,  meaning  that  the  second  possibility 
in  Theorem  4  does  not  apply  for  realistic  cases.  How¬ 
ever,  we  were  unable  to  prove  that. 

Besides  resonance  points,  the  CE  can  also  have 
singular  points.  In  order  to  define  these  points,  we  first 
introduce  the  Jacobian  of  the  CE  which  is  defined  for 
a  given  A,  £,  and  x, : 


D( \,x, E) 


3mft 

?7„  ’ 


(A3  8) 


where 


—  ,  i=2  ,  ...,ra. 


In  the  case  of  CE  we  have 

1  (11,2  +  A82F,)  ...  (/,,„  +  \d~VJ 

x,  (A,j  -  £  +  ...  (h.mT  \dmV,) 

D(A,x,£)  =  -  • 

*,  <*„2  +  A  3lvm)  ...»,„_£  + A8-VJ 

(A38') 


and  similarly  for  the  MCE.  Now  the  following  theorem 
can  be  proven. 


Theorem  5:  Let  the  CE 

[H  +  Wt'X)\x  =  Ex  (A3  9) 

for  A=A„  have  a  solution  (£0,x0).  Further  let  the  cor¬ 
responding  Jacobian  be  different  from  zero: 

Xl(Aj,  Eq.Xq)  *  0.  (A40) 

Then  there  is  E(\)  and  x(A)  such  that: 


1.  £(A)  as  well  as  each  component  of  x(A)  are  analy¬ 
tic  functions  in  the  whole  complex  A  plane,  with  possible 
exceptions  of,  at  most,  a  finite  number  of  points. 

2.  [£(A),x(A)l  is  a  solution  of  a  CE  (A39)  and  for 
A  =  Aq  it  concides  with  (£0,x0). 

This  solution  [E(A),x(A)]  we  shall  call  a  normal  solu¬ 
tion.  If  [£(A),x(A)]  is  defined  in  the  point  A  =  0  we  shall 
call  it  regular.  Also,  eigenvalue  £(A)  and  eigenfunction 
x(A)  we  shall  call  a  normal  (regular)  eigenvalue  and  a 
normal  (regular)  eigenfunction,  respectively.  Note  that 
by  the  very  definition  for  each  normal  solution  there  is 
at  least  one  point  A  such  that  the  corresponding  Jacobian 
is  different  from  zero. 


Proof :  The  proof  of  the  above  Theorem  5  is  based  on 
the  following  Theorem  6. 

Theorem  6:  Let  ?(x,_v)  be  continuous  for 

vector  x  in  a  neighborhood  of  x0  in  and  for  vector  y 
in  a  neighborhood  v0  in  £„,  with  g(x0,  v0)  =  0.  Suppose.? 
is  continuously  differentiable  in  y  and  that  the 
determinant 


(A41 ) 
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Then  there  is  a  neighborhood  .V, (v0)  in  and  a  neighbor¬ 
hood  .V.(v„)  in  £,  such  that  for  every  i  in  V,  there  is  a 
unique  v  ==  o’  tv )  in  ,V;  for  which  ,'(v,  y’(.vl|  =  0.  [f  -  (.v ,  v) 
is  l times  continuously  differentiable  in  v  and  v,  then 
«H*)  is  A-  times  continuously  differentiable  (A-  ?  1). 

The  above  theorem  is  proven  in  Ref.  15  for  the  case 
of  real  spaces  and  /?„.  However,  it  holds  for  the 
case  of  complex  spaces  as  well,  which  can  be  proven 
along  the  same  lines. 

Putting  x  =  X  and  v a  (E,x)  the  above  theorem  can  im¬ 
mediately  be  applied  to  the  CE.  Since  I'Cv)  is  analytic 
in  some  neighborhood  of  a  point  r  =.t0,  £(x)  and  x(\) 
are  analytic  as  well  in  some  neighborhood  .V(x0)  of  a 
point  X  =  \0  where  a  solution  exists.  Due  to  the  algebraic 
character  of  the  CE  this  solution  shall  be  algebraic  as 
well,  and  we  can  make  an  analytic  continuation  of  the 
function  £(,\)  and  jt(X)  in  the  whole  X  plane.  Since  these 
functions  are  algebraic  in  a  small  neighborhood  .VfA^), 
they  are  algebraic  in  the  whole  X  plane  as  well.  We 
have  only  to  show  that  [E(A),jc(A)]  is  a  solution  of  CE. 

In  .V(Xo)  the  solution  i£(A),.r(x)]  satisfies 

/.-[X,  £(x),w(X)]  =  0.  (A42) 

Since  £(A)  and  .v(X)  are  analytic  in  X,  and  since  f,  is 
analytic  in  A  £  and  x,  it  is  analytic  in  X  as  well.  On 
the  other  hand,  /,  is  identically  zero  in  .V(x0)  and,  being 
analytic  in  X,  should  therefore  be  identically  zero  for 
all  X.  This  proves  that  [£(X),w(X)]  satisfies  CE  for  each 
X  where  it  is  defined.  However,  £(A)  and  x(X)  are  alge¬ 
braic  functions  of  x;  hence  they  are  defined  for  each  X 
with  a  possible  exception  of  a  finite  number  of  poles 
and  branch  points. 

It  is  interesting  to  see  the  meaning  of  the  require¬ 
ment  (A40)  in  the  above  theorem.  For  that  purpose,  we 
use  the  following  geometric  interpretations: 

Each  equation /jiXo,  £,x)  =  0  for  a  given  X  =  x0  repre¬ 
sents  a  finite  number  of  hypersurfaces  in  the  m-dimen- 
sional  space  Am  spanned  by  £  and  x3, . . .  ,r„.  The  vector 
£(■*)  =  v3’/),  32/i; ....  3mft]  is  orthogonal  to  the 
hypersurface  in  the  point  (E,x).  The  Jacobian  (A38) 
consists  of  vectors  V/,,  V/;, , . . ,  V/„  and  hence  the  van¬ 
ishing  of  this  quantity  implies  a  linear  dependence  of 
these  vectors.  D*  0  thus  means  that  V/,  to  Vfn  are  not 
linearly  dependent.  If  the  point  (E,.*,,)  satisfies 
/j(Ao,£0,.v0)  =  0  it  should  lie  on  a  common  intersection 
of  all  hypersurfaces  /,  =  0.  Moreover,  if  the  correspond¬ 
ing  V/,  are  linearly  independent,  the  vectors  ortho¬ 
gonal  to/,  =  0  are  linearly  independent.  Hence  there  is 
some  neighborhood  of  point  (£0)x0)  in  A'"  where  there 
is  no  other  common  intersection  of  hypersurfaces  /, 

=  0.  Thus  the  nonvanishing  of  the  Jacobian  [Eq.  (A40)) 
implies  that  the  solution  (£0,.v0)  should  be  isolated  in 
the  space  A'’*. 

Definition  3:  Let  (£o,w0)  be  a  solution  of  the  CE  in 
the  point  A  =  Aa.  Further,  in  each  neighborhood  of 
(E,,.r„)  let  (Eij.xJ)  *  (£0,x0)  exist  such  that  it  is  a  solu¬ 
tion  of  the  CE  for  A  =  A,  as  well.  We  say  that  (£0,x0)  is 
a  continuously  degenerate  solution  of  the  CE  in  the  point 
X  =  x^. 

According  to  the  definition,  a  continuously  degenerate 
solution  is  not  isolated.  Also,  Eq.  (A36)  implies  that  a 


resonance  solution  cannot  be  isolated.  Hence,  and  in 
connection  with  the  above  conclusion,  we  did  prove  the 
following  Lemmas. 

Lemma  8:  if  (Ea,xa)  is  a  continuously  degenerate  solu¬ 
tion  of  the  C E  for  V  =  X1,  the  corresponding  Jacobian 
D(\a,  Eatxa)  is  equal  to  zero. 

Lemma  9:  If  (Er,xr)  is  a  resonance  solution  of  the  CE 
in  a  resonance  point  A  =  x,,  the  corresponding  Jacobian 
D[xrt  E(xr),*(\,)]  is  equal  to  zero. 


Lemma  10:  If  X  =  x6  is  a  branch  point  of  a  normal 
solution  [£(A),.v(x)],  the  corresponding  Jacobian 
Dt\h,  EfA^J.-vlAs)]  is  equal  to  zero. 

N'amely  if  D(x,)*0,  by  Theorem  5  the  solution 
[£(.>),. v(A)]  should  be  analyt  ic  in  the  point  A  =  As  which 
contradicts  the  assumption  that  this  is  a  branch  point. 

The  inverse  is  not  true:  Vanishing  of  the  Jacobian 
does  not  necessarily  mean  that  the  point  A  =  X'  is  either 
a  branch  point,  a  resonance  point,  or  a  point  of  conti¬ 
nuous  degeneracy.  The  above  geometrical  interpreta¬ 
tion  implies  that  if  D( A0)  =  0  the  solution  (£„,.va)  should 
be  unstable.  Namely,  if  V/(  are  linearly  dependent, 
the  point  ( £" 0 , -v 0 )  is  the  common  intersection  of  all  /,  =  0 
to  at  least  first  order.  Hence  there  are  some  vectors 
(£,.r)  in  a  space  A'"  which  differ  to  the  first  order  from 
(E0,*0),  but  also  satisfy  the  CE  to  at  least  second  order. 
Note  that  by  the  very  definition,  continuously  degenerate 
solutions,  solutions  in  a  branch  point,  and  resonance 
solutions  are  unstable  as  well. 


Theorem  5  now  provides  a  basis  for  the  definition 
and  classification  of  different  types  of  singular  points. 
Intuitively,  each  point  where  the  normal  solution 
£(A),x/A)I  is  not  analytic,  or  is  unstable  due  to  any 
reason,  we  consider  as  singular.  Thus  singular  points 
are  singular  points  of  a  given  normal  solution 
[£(X),w(X)],  and  each  normal  solution  can  have  a  dif¬ 
ferent  set  of  singular  points.  On  the  other  hand,  reso¬ 
nance  points  are  characteristic  of  the  CE  as  a  whole. 
We  can  have  the  following  singular  points: 

1.  Point  A  =  Xf  where  solution  i£(A),x(A)j  is  not  de¬ 
fined.  Since  [£(x),x(A)l  is  algebraic  in  x,  such  a  point 
is  a  pole.  As  we  approach  X  =  X,  at  least  one  component 
of  a  vector  x(x)  tends  to  infinity.  Hence 


lim 
»-  »0 


(*(A)l») 

['*(A):*(A)>]l'J 


=  0. 


(A43) 


This  means  that  x{\)  is  more  and  more  orthogonal  to  $. 
No  vector  orthogonal  to  4>  can  be  a  solution  to  CE. 

2.  Point  X  =  X,  which  is  a  branch  point  of  the  solution 
[£(X),x(A)].  Obviously,  if  At  is  a  branch  point,  £>(\J 
should  be  equal  to  zero.  If  it  were  not,  there  would  ex¬ 
ist,  according  to  Theorem  6,  a  unique  analytic  conti¬ 
nuation  of  the  solution  (E(A#),.ic(Ab)]  in  some  neighbor¬ 
hood  of  this  point,  which  is  contrary  to  the  definition 
of  a  branch  point.  We  call  such  a  singularity  a  branch 
singularity. 

3.  Point  A  =  x<  where  the  normal  solution  f£(X),.t(X)] 
is  continuously  degenerate.  According  to  Lemma  8, 
Jacobian  D(x)  vanishes  in  this  point.  We  say  that  A  =  x* 
is  a  point  of  accidental  degeneracy. 
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4.  Point  a  =  a„  where  the  normal  solution  [£(a),x(a)| 
is  defined,  the  corresponding  Jacobian  is  zero  but  the 
vanishing  of  the  Jacobian  is  not  due  to  any  of  the  former 
reasons  (branch  point  or  accidental  degeneracy).  Such 
a  point  we  call  a  point  of  a  hidden  singularity.  As  shown 
f  above,  the  solution  [E(a),x(a)1  is  unstable  in  this  point. 

In  short  A  =  Aq  is  a  singular  point  of  a  normal  solution 
( H* ( A ) , a' ( a ) f  if  either  [£(A),x(X)]  is  not  defined  in  this 
point  (pole)  or,  provided  [£(A),a(A)]  is  defined  in  A  =  A$, 
the  corresponding  Jacobian  vanishes  (branch  point,  ac¬ 
cidental  degeneracy,  and  hidden  singularity).  Due  to 
the  algebraic  character  of  the  solution  (E(A),a(a)]  and 
Jacobian  0(A),  there  can  be  only  a  finite  number  of 
singular  points.  If  it  were  not  so>  would  be  iden¬ 
tically  contrary  to  the  assumption  that  [£(A),x(A)] 

Is  a  normal  solution. 

The  above  definition  of  singular  points  applies  only 
to  the  normal  solution  [EM,  x(A)[,  i.  e.  ,  such  that  there 
\  =  a' with  the  property  0(A')  *  0.  However,  we 
K  lve  n0  guarantee  that  in  any  point,  including  A  =  0, 
there  exists  a  solution  with  nonvanishing  Jacobian.  In 
order  to  overcome  this  limitation,  instead  of  the  CE 
(A23)  we  observe  MCE  (A23').  We  chose  a  real  Hermi- 
tian  operator  A  in  such  a  way  that  no  eigenfunction  of 
the  equation 

.Ax=ax  (A44) 


is  orthogonal  to  4>,  and  that  all  corresponding  Jacobians 
are  different  from  zero.  In  other  words,  A  satisfies 


x)  =  - 


(x !  <S>)  *  0, 

!  1  al! 
xi 


and 


/ 

a*l 


*0, 


E 


(A45) 


a  n  —  aTt  —  a  i 


Obviously  such  an  operator  always  exists  which  can  be 
demonstrated  by  construction.  We  now  prove  Lemma 
11. 


Lemma  11:  There  exists  «0*0  such  that  if  l«l  and 
e*0  no  eigenfunction  of  the  equation 

( H  +  iA)x  =  Ex  (A46) 

is  orthogonal  to  <fr,  and  all  corresponding  Jacobians  are 
different  from  zero. 

Proof:  If  €  is  big  enough  Eq.  (A46)  is  close  enough  to 
Eq.  (A44)  and  since  A  and  H  are  bounded  operators, 
each  solution  of  Eq.  (A46)  is  close  enough  to  some 
solution  of  Eq.  (A44),  and  vice  versa.  Hence  for  big 
enough  e,  no  solution  of  (A46)  is  orthogonal  to  and  all 
corresponding  Jacobians  are  different  from  zero.  How¬ 
ever  Eq.  (A46)  has  the  structure  of  a  CE  and,  by  Theo¬ 
rem  5,  all  those  solutions  are  analytic  in  «,  with  the 
possible  exception  of  a  finite  number  of  «  points.  It  fol¬ 
lows  that  overlaps  (x(e)l<t)  are  analytic  in<,  and  hence 
can  become  zero  only  in  a  finite  number  of  points.  The 


same  holds  true  for  the  corresponding  Jacobians.  Hence 
there  is  a  small  cirrle  in  the  complex  <  plane  around  a 
point  i  =0  where  no  solution  of  Eq.  (A46)  is  orthogonal 
to  4>  and  where  all  the  corresponding  Jacobians  are  dif¬ 
ferent  from  zero,  with  the  possible  exception  of  the 
point  £  =  0.  This  proves  Lemma  11. 

We  will  consider  the  condition  <  *  0  and  c  '  r0  as  a 
part  of  a  definition  of  MCE.  Also,  without  loss  of  gen¬ 
erality,  e  can  be  assumed  real.  With  that  in  mind,  and 
in  combination  with  Theorem  5,  the  above  Lemma 
proves  the  following  Theorem  7. 

Theorem  7:  The  modified  characteristic  equation  has 
m  regular  solutions,  i.  e.  ,  the  same  number  as  the 
corresponding  Cl.  This  means  that  in  each  point  A  =  A' 
an  MCE  has  m  solutions  which  are  analytically  con¬ 
nected  with  m  solutions  of  the  Cl  equations  tpoint  A  =  0) , 
unless  A=  A'  happens  to  either  be  a  pole  or  a  branch 
point  of  some  regular  solution. 

We  can  now  state  something  about  the  reality  of  the 
solutions  [E(A),x(A)]. 

Lemma  12:  Let  the  Hamiltonian  H  be  real  and  let 
[£(A),x(A)]  be  a  regular  solution  of  the  CE.  Assume 
furthermore  that  [£(X),  x(\)]  has  no  branch  point  on  the 
real  axis  between  the  real  points  A  =  0  and  A  =  \<>.  Pro¬ 
vided  A,  is  not  a  pole,  the  solution  [^(A^) ,  x(Xo) ]  exists 
and  it  is  real. 

Proof:  Since  [£(a),x(X)]  is  regular,  it  is  defined  in 
A  =  0.  Since  H  is  real,  [£(0),x(0)]  will  be  real.  We  can 
now  reach  \  =  A5  starting  from  A  =  0  along  some  path  P 
in  the  complex  A  plane.  P  can  always  be  chosen  such 
that  it  avoids  all  possible  singular  points,  and  that  the 
area  enclosed  by  P  and  the  real  axis  does  not  contain 
branch  points  [see  Fig.  4],  Along  P,  we  have  the  solu¬ 
tion  [E(X),x(A)]  which  smoothly  changes  from  [E(0),x(0)] 
to  (£(A0),x(A<))].  Now  we  can  reach  A,  as  well  along  path 
P*  which  is  a  complex  conjugate  of  P.  From  Lemma  6, 
and  because  there  are  no  branch  points  between  P  and 
the  real  axis,  there  is  no  branch  point  between  P *  and 
the  real  axis  either.  Therefore,  there  is  no  branch  point 
in  the  whole  area  enclosed  by  P  and  Pm.  Hence  the  an¬ 
alytic  continuation  of  [£(0),x(0)]  along  either  path  will 
yield  the  same  solutions  in  the  point  A„.  However,  from 
Lemma  6  and  the  reality  of  [£(0),x(0)j,  it  follows  that 

[£(A),x(A)]  =  [E*(A*),x*(A*)] 
and  hence,  for  A  =  A0  =  A0*, 

[£(A0).x(A<,)]  =  [£*(A0),x*(A1,)]  (A47) 

expressing  the  reality  of  the  solution. 


FIG.  4.  Paths  P  and  P *  in 
the  complex  A  plane  con¬ 
necting  a  configuration-inter¬ 
action  solution  IX  =  0)  with  that 
in  point  A  =  A,  on  the  real  axis. 
Solution  (£(Afl),  t(Vq) ]  is  real 
provided  there  is  no  branch¬ 
point  on  the  real  axis  between 
0  and  Aq. 
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Note  in  the  above  proof  that  the  condition  that  there 
is  no  branch  point  between  \  =  0  and  A  -  on  the  real 
axis  is  essential  since,  if  such  a  point  would  exist,  the 
analytic  continuations  along  P  and  P’  would  give  two 
different  solutions  — say,  (£,(A0),  v,(X0))  and  (£,(A0),  .v,(A0)]. 
All  we  could  then  infer  is  that  £,(A0)  =  and  v , ( ) 

—  xi  (A0).  However,  the  absence  of  poles  between  0  and 
A0  is  not  essential. 
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The  density  of  stales  (DOS)  of  extended  systems,  calculated  at  the  ngorous  Hartree-Fock  (HF)  level,  can 
have  a  number  of  unphysica!  features.  It  is  shown  analytically  that  in  partially  filled  band  systems  (crystals, 
thin  films,  polymers)  the  HF  DOS  vanishes  at  the  Fermi  energy,  regardless  of  Fermi-surface  shape.  HF 
DOS  will  also  vanish,  in  the  (rare)  event  that  an  equienergetic  surface  S  for  an  energy  different  from  the 
Fermi  energy  coincides  with  the  Fermi  surface  Sr  Additional  features  such  as  shoulders,  peaks,  or  near 
gaps  can  occur  it  energies  with  surfaces  S  close  to  SF  No  KF-related  zero  DOS  arises  in  filled-band 
extended  systems.  Published  HF  DOS  are  discussed.  A  detailed  summary  of  expressions  for  crystal  HF 
matrix  elements  in  momentum  representation  is  given.  Their  modification  for  thin  films  and  polymers  is 
indicated. 


I.  INTRODUCTION 

Since  1969  several  calculations  have  appeared  in 
the  literature  that  aim  towards  obtaining  rigorous 
Hartree-Fock  (HF)  results  for  crystalline  solids. 
These  calculations  differed  from  previous  work  to 
the  extent  that  full,  nonlocal  exchange  was  taken 
into  account,  and  that  integrals  were  calculated 
accurately  or  approximated  carefully.  Because 
only  limited  basis  sets  were  used  in  Bloch  orbital 
expansions  no  calculation  can  be  considered  to  have 
reached  the  HF  limit.  Nevertheless,  there  are  in¬ 
dications  that  several  have  come  pretty  close. 

Two  fundamentally  different  approaches  to  rigor¬ 
ous  HF  have  been  used.  One  of  these  can  be  char¬ 
acterized  as  a  configuration-space  (CS)  approach, 
since  the  HF  matrix  elements  are  expressed  as 
real  (or  direct)  lattice  sums  over  multicenter  inte¬ 
grals.  A  formulation,  using  Adaras-Gilbert  local 
orbitals,'  was  first  given  by  Kunz.3  Applications 
were  made  to  the  optical  properties  of  solid  rare 
gases,1  alkali  halides.'  and  lithium  hydride.5  The 
linear-combination-of -atomic -orbitals  (LCAO) 
version  of  this  approach  was  applied  to  diamond,6 
HF,  and  Ne.7  More  recently,  the  CS  approach 
was  applied  to  some  '‘open-shell”  solids  (with  par¬ 
tially  filled  bands)  such  as  calcium,*  lithium,*  and 
TiO.10 

The  other  approach  can  be  described  as  a  mo¬ 
mentum-space  (MS)  approach,  because  recipro¬ 
cal  lattice  summations  are  used  to  compute  HF 
matrLx  elements.  First  introduced  by  Harris  and 
Monkhorst,"  the  MS  (or  Fourier)  method  has  also 
been  used  for  diamond  by  Mauger  and  Lannoo,13 
employing  Slater-type  orbitals  for  atomic  basis 
functions.  Very  recently  the  MS  approach  has 
been  analyzed  by  Cox  and  Fry13  and  Fry  et  al ." 
(hereafter  referred  to  by  FBBi.  Applications  to 
properties  of  several  simple  metals  have  also  ap¬ 
peared.15 


The  CS  and  MS  formulations  are  totally  different 
in  their  analytic  and  computational  details,  al¬ 
though,  when  carried  to  their  limits,  the  two  ap¬ 
proaches  are  identical.  We  already  indicated  that 
the  CS  and  MS  methods  are  based  on  direct  and 
reciprocal  lattice  summations,  respectively.  This 
difference  has  several  consequences  of  great  prac¬ 
tical  import.  To  name  a  few: 

(i)  The  CS  approach  is  most  appropriate  for  in¬ 
sulating  crystals,  whereas  the  MS  approach  con¬ 
verges  best  for  conductors. 

(ii)  The  Madelung-type  conditional  convergence 
problem  that  troubles  the  CS  method  is  eliminated 
in  the  MS  method  through  a  rigorous  cancellation 
of  singwlarities.  As  a  result,  total  HF  energies 
with  the  CS  method  can  oscillate  seriously  with  as¬ 
sumed  unit  cell  cluster  size  (implicit  ui  the  real 
lattice  sum  truncation),  whereas  in  the  MS  method 
these  energies  are  stable. 

(iii)  The  most  significant  difference  is  that  the 
CS  method  requires  the  calculation  and  manipula¬ 
tion  of  horrendous  numbers  of  many -center  inte¬ 
grals,  particularly  when  Gaussian-type  orbitals 
are  used.  The  MS  method,  however,  allows  the 
reductions  to  reciprocal  lattice  sums  involving 
Fourier  transforms  of  basis  functions  only.  Con¬ 
sequently,  when  n  orbitals  occur  per  unit  cell,  the 
computing  effort  of  the  CS  method  scales  like  n\ 
and  the  MS  method  scales  more  like  n3. 

In  physical  terms  one  can  say  that  the  CS  ap¬ 
proaches  emphasize  the  calculation  of  the  total 
wave  function  and  charge  distribution,  whereas  the 
MS  approach  emphasizes  the  deviation  from  a  uni¬ 
form  charge  distribution  and  associated  wave  func¬ 
tions.  It  can  thus  be  understood  that  the  methods 
have  different  ranges  of  applicability. 

The  successes  of  the  crystal  HF  calculations  are 
based  on  the  precise  definition  of  the  HF  method. 
This  situation  allows  us  to  draw  well-founded  con¬ 
clusions  on  the  correlation  effects  in  band  widths, 
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gaps,  and  density  of  states  (DOS).  Moreover,  and 
possibly  most  significantly,  its  variational  charac¬ 
ter  admits  realistic  geometry  optimization.  Final¬ 
ly,  with  the  inclusion  of  proper  basis  functions, 
the  method  gives  excellent  charge  distributions, 
with  which  many  properties  can  be  obtained. 

Yet,  the  HF  method  has  been  discarded  by  many 
band  theorists  because  of  its  “unphysical”  aspects. 
They  often  refer  to  the  vanishing  DOS  at  the  Fermi 
level  in  the  electron  gas  and  the  enormous  widen¬ 
ing  of  the  occupied  band.  Indeed,  even  the  valence 
and  conduction  bands  of  alkali  halides  and  diamond 
are  substantially  wider  than  those  inferred  from 
experiments.4'* • 10-  12  In  some  instances  correla¬ 
tion  corrections  have  been  made,  and  invariably 
the  bandwidths  become  more  “realistic."2,  *• 10  •  “ 
Whether  the  DOS  vanishes  at  the  Fermi  level  in 
crystals  with  partially  filled  bands  had  not  been 
clearly  established  so  far.  Jennison*  and  FBB“ 
discuss  this  question,  but  admit  that  it  would  be 
difficult  to  see  in  a  DOS  calculation  with  the  con¬ 
ventional  methods. 

We  wish  to  present  a  rigorous  proof  that,  in¬ 
deed,  the  DOS  at  the  Fermi  level  vanishes  rigor¬ 
ously  whenever  partially  filled  bands  occur,  re¬ 
gardless  of  the  shape  and  the  connectivity  of  the 
Fermi  surface.  We  will  also  show  that  this  is  the 
case  for  extended  systems  periodic  in  one  or  two 
dimensions,  such  as  polymers  and  thin  films.  No 
special  features  are  expected  in  the  DOS  for  filled 
and  empty  bands,  except  for  widening  of  bands 
and  gaps  near  the  Fermi  level.  Finally,  we  raise 
the  warning  that  unphysical  features  might  occur 
in  the  DOS  as  conventionally  calculated.  These 
are  related  to  the  sudden  drop  to  zero  of  the  DOS 
at  the  top  of  the  Fermi  sea. 

The  above  behavior  of  the  DOS  is  associated 
with  the  extensiveness  of  the  systems.  Therefore, 
it  is  not  surprising  that  it  can  be  most  elegantly 
shown  with  the  MS  approach.  In  Sec.  H  we  sum¬ 
marize  the  basic  formulas  of  crystal  HF  in  the 
MS  representation.  This  enables  us  to  most  easily 
discuss  the  analytic  behavior  of  the  band  energies 
(Sec.  EH).  Then  follows  the  proof  that  the  DOS 
vanishes  for  partially  filled  band  systems  (Secs. 

IV  and  V).  A  discussion  of  its  consequences  can 
be  found  in  Sec.  VI. 

II.  HARTREEFOCK  IN  MOMENTUM  REPRESENTATION 

Consider  a  crystalline  solid  with  A  atoms  per 
unit  cell,  with  nuclear  charges  Zu  Z„  ...  ,ZA 
and  position  vectors  su  s„ . . . ,  sA  relative  to  the 
unit-cell  origin.  The  unit  cell  volume  is  </„.  The 
lattice  vectors  are  indicated  by  3,  S',  etc.,  and 
the  associated  reciprocal  lattice  vectors  are  de¬ 
noted  R,  t?',  etc. 


We  assume  that  the  HF  Bloch  wave  functions 
|vk)  can  be  expressed  in  the  LCAO  form 

I ‘*>='£  lf*>c»Ak) .  U) 

» 

where  the  basis  Bloch  orbitals  j pk)  in  r  repre¬ 
sentation  are  given  by 

(rlpliT)*^'2^  expltic- R)<? - S i p)  (2) 

R 

as  a  real  lattice  summation,  or 

(r  jpic)  =  — — 22  exp[i(K  +  k)‘ r](k-  fv|p>  (3) 

V°  K 

as  a  reciprocal  lattice  sum.  .V  is  the  number  of 
unit  cells,  and  k  is  the  Bloch  vector  m  the  Bril- 
Louin  zone  (BZ).  Ip)  are  the  atomic  orbitals. 

The  |  i/k>  are  assumed  to  satisfy  the  orthonor¬ 
mality  condition 

(pk  |  i/k  >=  5^. ,  (4) 

or,  using  Eq.  (1)  and  the  identification 

5M(f)*<pk|«7k>,  (5) 

we  can  write 

'£iCmJ.i)SjZ)CJ$)-ia¥.  (6) 

H 

For  later  developments  it  is  convenient  to  write 
.Vf(k)  for  a  matrix  with  elements  A/M(k ),  and  C,(k ) 
for  a  vector  with  components  C„(k ).  We  can  now 
compactly  write,  instead  of  Eq.  (6), 

C;(k)3Jk)C,(k).5^.  (7) 

We  will  need  the  density  matrix  £(k),  with  ele¬ 
ments 

»,(^>Cyk)C^f).  (8) 


The  occupation  number  n„(k)  is  defined  as 


n„(k)» 


1,  if  EJ&)*Er, 

0,  if  E.(Vt)>£f, 


(9) 


where  £„(£)  and  Er  are  the  HF  band  energies  and 
Fermi  energy  as  calculated  below.  Adopting  a  re¬ 
stricted  HF  scheme  (double  occupancy  of  the  or¬ 
bitals),  we  have  the  charge  neutrality  condition 


JJ  22  D..t*X/*>'*(22  *.).  uo) 


where  the  k  sum  is  over  the  BZ.  Using  the  equiva¬ 
lence.  as  .V-  «, 
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we  can  equivalently  write,  thanks  to  (7)  and  (8), 

*  m»l 

The  HF  equations  can  now  be  cast  in  the  form 
[F(k)-Ev(k)S(k)]C.fk)-0.  (13) 

As  usual,  these  equations  are  obtained  by  minimiz¬ 
ing  the  HF  energy  with  constraints  of  Eqs.  (7)  and 
(10)  [or  (12)].  Indeed,  the  latter  condition  im¬ 
plicitly  defines  the  Fermi  energy  Er  The  Fock 
matrix  has  the  following  structure: 


*■*(£)■</*  !#!?£>; 

(14) 

F=  T+  V*C*X. 

(15) 

MONKHORST 

</>k  jF|(?k>»-iTT'  '  </>jR-it)F(if-R')<R'+k|a>; 

vo  -- 
K.K' 

(19) 

with  the  definitions 

l/(Q)  =  _i£z(3);  (20) 

z(5)=  ^  2mexpu3-sJ:  (21) 

m*  1 

(/>ir]c|,k)=iT^'</>iK^>«R-R')<K'-k!?> 

K.ii’ 

(22) 


f,  V,  C,  and  X  are  the  kinetic  energy,  nuclear  at¬ 
traction,  electron-electron  repulsion,  and  ex¬ 
change  operators,  respectively. 

We  are  now  ready  to  discuss  the  expression  for 
the  various  matrix  elements  over  the  operators 
in  /.  As  we  indicated,  we  introduced  the  momen¬ 
tum  representation  for  all  integrals.  This  was 
followed  by  an  interchange  of  the  integrations  and 
lattice  sums,  wherever  admissible.  Subsequently, 
use  was  made  of  the  lattice  orthogonality  rela¬ 
tion;  as  ,V  — 

V  exp(i$-I5)  =  —  £  <XQ-R).  (16) 

-  u0 


with  the  definition 

{dlk'ZDr.^ 

r» 


Finally, 

A„(k).<pk  ii|,k) 


*  (R'rk'|s>j  .  (23) 


(p|R  +  k>X(R+k, K'-k)(R'*k  |ij), 


V.-- 

K,  K 


For  V  and  C,  singularities  arise  for  K»0,  due  to 
the  long-range  nature  of  the  Coulomb  potential.  A 
careful  analysis  reveals  that  these  singularities 
rigorously  cancel,  provided  that,  besides  charge 
neutrality,  the  unit  cells  have  no  dipole,  quadru¬ 
ple,  or  second  moments.17  (This  condition  is 
generally  overlooked  in  conventional  applications 
of  Ewald-type  lattice  summations,  and  its  non¬ 
satisfaction  can  cause  shifts  in  the  band  energies 
and  total  energies  of  the  CS  approach.1*)  The  ex¬ 
pressions  below  are  therefore  the  remainders 
after  this  long-range  cancellation.  These  left¬ 
overs  can  be  interpreted  as  resulting  from  a  de¬ 
viation  from  a  uniform-charge  distribution. 

Casting  the  formulas  in  forms  convenient  for 
further  discussion  we  have 

<p|R»ii><K*k  !?>;  (l7) 

L0  - 

K 

</>k  ;f|,iT>.i£  (PlK-k)iii^<R*ki7); 

Vq  -  * 

K 

(18) 


(24) 

where  we  have  used  the  definition 
-V(5.3')« -(jpp  f  <#V  £  Dr, Ik') 

rl 

x  (ir  <rlK-’-5-5>iO(K»-k'ls)\ 

K- 

1 25) 

The  primes  to  the  R.K'  sums  of  Eqs.  (19)  and  (21) 
indicate  the  exclusion  of  K»i?'.  It  is  understood 
that  5  -5'  is  a  reciprocal  lattice  vector.  Notice 
that  the  nonlocal  character  of  X  is  reflected  by  the 
fact  that  X(<5,3')  is  not  a  function  of  $  -5'  alone. 

At  this  point  it  is  well  to  remind  the  reader  that 
Eqs.  (17)-(24)  are  not  always  most  convenient  in 
actual  calculations.  We  have  found  various  alter¬ 
native  expressions  more  conducive  to  exploitation 
of  convergence  characteristics.11117  In  particular, 
when  />)  and  or  :  <?'  are  corelike  orbitals,  direct- 
lattice  sums  give  more  rapid  convergence.  In 
practice,  therefore,  we  have  used  a  mixture  of 
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direct  and  reciprocal  lattice  sums.11 

The  HF  total  energy  per  unit  cell  £Hr  can  be  ob¬ 
tained  from  the  expression 

Mr 


*  L  [T.,(k)  +  V„(k)]]o>t(k)*C. 

M 

(26) 


T  and  Fare  given  by  Eqs.  (.18) — (20) ,  and  C  is  a 
constant  characteristic  of  the  lattice  structure 
and  cell  size 


The  sum  in  C  excludes  K=Q,  and  the  integral  is 
over  all  momentum  values.  This  term  appears 
when  the  singularity  from  the  V  contribution  to 
(26)  is  balanced  against  that  arising  from  the  nu¬ 
clear-nuclear  repulsion.17  Of  course,  the  same 
condition  applied  to  this  cancellation  as  discussed 
below  Eq.  (16);  the  singularities  have  the  same 
origin.  The  constant  C  can  be  interpreted  as  the 
electrostatic  energy  between  the  nuclei  with 
charges  and  positions  s„  within  the  unit  cell 
and  a  uniform  compensating  background  of  nega¬ 
tive  charge.  It  is  easy  to  show  that  C  vanishes  if, 
in  addition,  the  (positive)  nuclear  charge  is  spread 
out  uniformly.  This  is  the  situation  in  the  elec¬ 
tron-gas  model  for  a  solid. 

From  Eqs.  (17) -(25)  it  should  be  clear  that  the 
MS  formulation  is  really  advantageous  for  valence 
and  conduction  bands:  the  i?,K'  sums  will  converge 
fast.  A  few  such  expressions  can  be  found  in  FBB. 

In  conventional  band  calculations  C  has  been 
usually  ignored.  This  is  unfortunate  because  it 
is  easy  to  compute  for  any  lattice  structure,17 
and  its  inclusion  allows  a  realistic  total-energy 
evaluation. 

Finally,  we  wish  to  point  out  that  all  K  sums  are 
accompanied  by  a  factor  l,  vn.  This  will  prove  im¬ 
portant  when  discussing  extended  systems  with 
lower  dimensionality  (see  Sec.  V). 


HI.  AN  EXPRESSION  FOR  ?£„ I  i  I 

For  our  analysis  of  the  HF  DOS  we  need  the 
gradient  of  the  band  energies.  Starting  from  (13) 
and  dropping  the  k  dependence  from  the  expres¬ 
sion,  we  get 

(VE-£„VS)£,+  (F-EUS)$C,-  ?£,££,»  0.  128) 

When  we  now  premultiply  with  £j;  using  the  herm- 
iticity  of  F,S  and  the  orthonormahty  condition  of 


Eq.  (7),  we  readily  arrive  at 

v£„(k)  =  £jiT)[V£(k)  -£v(k)?S(k)]£„(k).  (29) 

This  is  a  very  compact  expression  indeed.  It 
shows  that  no  explicit  variation  of  C„  needs  to  be 
considered;  only  variation  of  matrix  elements 
matters,  once  the  HF  equations  are  satisfied. 


IV.  HF  DENSITY  OF  STATES 


The  density  of  states  (DOS),  p(£)  can  be  ob¬ 
tained  using  a  variety  of  mathematically  equiva¬ 
lent,  but  computationally  distinct  formulas.  For 
our  purposes  the  expression 


t‘o  f  m*  d S 
^  ,  •  *  IV  £„(?,)  I 


is  convenient.  The  integral  is  over  equienergetic 
surfaces  S,  with  position  vectors  kJ(  so  that 

£„(k,)*£  (31) 

for  at  least  one  v  value,  ii,  is  the  unit  vector  nor¬ 
mal  to  S  in  kt.  pt£)  is  normalized  so  that 

j^pLEldExf^  Zn,  (32) 


with  £,.the  Fermi  energy. 

It  is  well  known  that  in  the  electron-gas  model 
p(£r)  =  0,  i.e.,  the  DOS  vanishes  at  the  Fermi  sur¬ 
face  Sr  This  is  caused  by  a  logarithmic  singu¬ 
larity  in  V£(k)  at  the  (spherical)  Sr  (see,  for 
example.  Ref.  26): 

;v£(k,)|  ~ln,fe,-*,|  as  S-S,.  (33) 

We  now  wish  to  show  that  a  similar  singularity 
occurs  in  any  crystal  with  partially  filled  bands. 

Our  starting  point  is  Eq.  (29).  Going  over  Eqs. 
(14)  and  ( 17)— < 25)  we  notice  that  the  k  dependence 
appears  either  m  the  basis  orbital  Fourier  trans¬ 
forms  or  in  the  kernels  of  the  sums  for  the  T  and 
X  matrix  elements.  Typically  Gaussian  or  Slater- 
type  basis  orbitals  have  been  used;  neither  have 
Fourier  transforms  with  singular  gradients.  In 
fact,  quantum-mechanical  continuity  conditions 
mandate  the  k  dependence  of  Bloch  functions  to  be 
"smooth”  analytically.  Therefore,  basis  functions 
with  discontinuous  k  behavior  are  to  be  excluded. 
Moreover,  the  factor  (S-k)1  in  Eq.  18)  is  ob¬ 
viously  analytic  in  k.  We  thus  conclude  that  the 
only  possible  source  for  a  singularity  in  V£^k)  is 
the  "exchange  kernel”  of  Eq.  i25i. 

Let  us  look  at  this  quantity  more  closely.  When¬ 
ever 


K'*3»k. 


the  k'  integrals  are  of  the  form 
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/ 


Ik.  -  k'  1 2 


with  /( k')  analytic,  and  k  and  k'  both  within  the 
BZ.  But  now  we  have  identified  a  possible  singu¬ 
larity,  since  the  integrand  diverges  whenever  it' 

*  it;  the  analyticity  in  it  after  k'  integration  need 
not  be  maintained.  Therefore,  we  can  restrict 
our  attention  to  A'(it),  and  write 

£’(10  =  £’(k  )A(it  )C,fk ) .  (34) 


In  order  to  more  clearly  exhibit  the  underlying 
analytic  structure  of  El,  we  recognize  that,  in 
general,  the  density  matrix  contains  a  sum  over 
both  filled  and  partially  filled  bands.  Therefore, 
with  regard  to  the  iT'  integration  domains,  we  can 
write 


£'(k 


'-/  cPk'Z1 

•' a  t  n 


UgJTvg) 


L 


-  IK+k-k'l 

K 


j  iK-t-it -  it'  iJ 


(35) 


The  details  of  Y\  and  KJ  are  irrelevant  for  our  dis¬ 
cussion  except  that  these  quantities  (i)  are  analytic 
in  2  and  it',  and  (ii)  cause  periodicity  in  k,it'  space 
to  the  integrands.  Vr  is  the  volume  within  Sr 
Both  k'  integrations  are  to  be  confined  to  (at  most) 
the  first  BZ.  The  first  integral  (the  filled  band 
contribution)  is  over  this  BZ,  including  its  bound¬ 
ary.  The  second  integral  (the  partially  filled-band 
contribution)  is  over  a  fraction  thereof,  Vr,  in¬ 
cluding  the  boundary,  the  Fermi  surface  Sr 
As  we  indicated,  nonanalyticity  in  k  can  occur 
from  K  =  0  in  Eq.  (35).  We  therefore  single  out 
those  terms  and  Taylor-expand  y‘, 

r; Kit,  it',  oi  =  K;(ic,  k,  oi-  (k'  -it) 

•  (v'ri(k,k',0)j.  (t  =  1,2). 

(36) 


Substituting  these  expressions  in  (35)  we  generate 
k'  integrals  of  the  form 


UW-I.  f 

■  y  Ik  -k  I 


(37) 


(k'  -k)dV 
it-  k'l2  ’ 


(38) 


where  V  is  either  BZ  or  V r  We  have  now  iso¬ 
lated  the  quantities  that  might  be  responsible  for 
singularities. 


V  INTEGRALS/,  AND  i, 


Let  us  first  consider  We  wish  to  apply 
Gauss’s  theorem.  If  k  is  within  V  we  must  ex¬ 
clude  a  small  sphere,  with  volume  v,  radius  e, 


and  surface  o,  around  it  so  as  to  have  continuity 
over(F-v).  Writing 


1 

lit -it' i2 


3  O'- 


(ic'-k) 

Ik -it' l2’ 


(39) 


we  express  in  the  form 

/0(F,k)3/0(K-f,k)*f0(f,k) ,  (40) 


or,  using  Gauss's  theorem  for  the  first  term, 


/„(  V  -  c. 


(k,-k)-dS  pk.-itl-rfj 
i  k ,  -  k  i 2  K  I  it  s  -  it  1 2 


(41) 

Z  is  the  boundary  to  V  (being  either  the  BZ  bound¬ 
ary  or  Sr).  ka  are  the  o  surface  position  vectors. 
Evaluating  the  integrals  associated  with  v  and  <r. 


*  4ire  .  (42) 

/"(it,  -  it)-  do  1  f,  , 

— ~ - r — do*  —4 n €  .  (43) 

ik,-k  I2  tJ. 

Combinmg  (40)-(43)  we  thus  conclude 


fc(  v,  k )  * 


f  (k.-k)-dS 
■e  l  k ,  - k i 2 


(44) 


As  €  can  be  made  arbitrarily  small,  this  result 
also  holds  for  it  on  Z. 

For  I,,  we  can  carry  through  a  similar  analysis. 
The  result  is 


MF.it)*  I  In  jit,  -  it  |<fS  ;  (45) 

•  z 

as  before,  this  holds  lor  all  it. 

We  are  now  ready  to  discuss  the  gradient  of  /„ 
and  /,.  Whenever  it  is  not  on  E,  i.e. ,  kis  either 
within  or  outside  V,  the  integrands  are  finite  and 
continuous  m  k.  We  therefore  can  bring  the  grad¬ 
ient  operation  under  the  integration.  For  example, 


v/0;r,k,= 

-s  lk.-kl2 


or,  using 
q=k,-k, 
we  can  write 


'  q-iq-  JS ) 


V/0iF,  k)*2  I 

1  z  7 


f  £ 

-z  7;  ’ 


(46) 


(47) 


Clearly  q  never  vanishes  in  this  case,  and  we  find 
r/ol  F.ioj  <«  ,  »kcT  148) 


Similarly,  it  is  easily  found 

?/,!  V,  ic )  |  ,  ik£-.  (49) 


We  thus  reach  our  first  conclusion:  (i)  The  band- 
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energy  gradient  CE^k)  will  have  finite  values  for 
all  k  not  on  either  the  BZ  boundary  or  on  the  Fer¬ 
mi  surface  Sf. 

Now  we  will  consider  the  case  that  k  approaches 
a  vector  k,  on  Z.  Because  of  a  lack  of  continuity  of 
the  integrands  in  the  surface  integrals  (44),  (45) 
we  cannot  bring  the  gradient  operation  under  the 
integral  signs.  Therefore  we  must  proceed  as 
follows: 

We  can  assume,  without  loss  of  generality,  that 
k  approaches  k,  so  that 


J-k-it, 

(50) 

is  an  infinitesimal  vector  parallel  to 
vector  normal  to  Z  in  k,, 

the  unit 

T/6=  raa. 

(51) 

6  is  the  distance  of  k  to  E.  The  minus  (plus)  sign 
applies  when  it  approaches  it,  from  within  (out¬ 
side).  We  next  cast  a  small  circle  denoted  by  c, 
around  k,  on  S.  Its  radius  <  is  taken  small  enough 
so  that  Z  can  be  considered  locally  flat.  We  then 
can  express  f„,  It  as 

u  v,  it)  *  /•(  v.iT)*  [/„(  V,  k )  - i;(  V,  k )  J 

(m  =  0, 1) , 

(52) 

where 

(53) 

Iff  V, ic)=  f  ln|k,-k|do. 

(54) 

Obviouslyithe  term  in  square  brackets  in  Eq.  (52) 
excludes  k,  from  the  surface  integral.  There¬ 
fore,  its  gradient  with  respect  to  k,  as  k-k,,  will 
remain  finite.  However,  this  is  not  the  case  for 
the  other  term  when  m*  0,  as  we  will  now  show. 
We  introduce  the  variable  q, 

q=it, -k.  (55) 

If  we  describe  the  integral  over  c  with  the  radial 
variable  r,  then  we  have 

q"d5=  6rdrdd>  ,  156) 

<7a.r,*5J.  (57) 

We  thus  are  led  to 

-  ru  r  rdr 
/sc v, k ) > a  /o  T. 

or 

/5(V,k)*ir[6ln(62*<2)  -  26  ln«]  .  158) 

Since  5*  ik-k,|,  we  readily  derive 

9/5^,k)-a2jn,  [ln«*^jI-iln(a,*<J)]  .  (59) 


Finally,  we  are  prepared  to  take  the  limit  k-k,. 
Doing  so,  Eqs.  (58)  and  (59)  immediately  give 
(with  «>0), 

lim V, it) » 0  ,  (60) 

UmV/gV.it)**  nm2rn1lnjk-k,j  .  (61) 

»-«»  i-i, 

Now,  considering  If,  we  quickly  get 

/ft  V,  it )  *  |  Jin  Ik, -fide  |  . 

or,  with  Eqs.  (35) — (57)  , 

If(  V,  k )  «  e  |  ( t2  *  6s)  ln(«2  *  62)  _  <2  -  262  lnfl  |  . 

(62) 

Furthermore, 


or,  using  (62), 

jv/f(V,k)|  s  2r | 25  ln5  -  6  ln(  <2  +  82)  |  .  (63) 

Again,  we  take  the  limit  ic-ict,  and  we  obtain 

Um/f(V,it)s!re2]21nc-l|  <«,  (64) 

lim  1  V/f(  V, k ) |  «0.  (65) 

»-*» 

Looking  at  Eqs.  (61),  (65),  and  remembering  the 
role  Icm  plays  in  /„  [Eq.  (52)  j  we  conclude 

lim  /,(  V, it) < "  ;  (66) 

lim  !,(  V,k)<»  ;  (67) 

‘-‘i 

lim  j  V/0(  V',  k  )  |  =  lira  |  2r  ln6|  ;  (68) 

**° 

lim  |v/,(V',k)j<«>.  (69) 


A.  Singularity  in  i  k ) 

Returning  to  the  Taylor  expansion  (36),  which 
gave  rise  to  /„  and  /,,  it  is  clear  that  higher -order 
forms  will  not  generate  any  divergences  in  ?£J( k); 
the  k'  integrands  will  not  be  singular.  Therefore, 
we  have  precisely  pinpointed  the  singularity  in 
VE„( k),  and  we  will  argue  that  it  only  occurs  m 
partially  filled  band  systems,  with  k  any  point  kr 
on  the  Fermi  surface. 

In  order  to  see  this,  we  first  remind  ourselves 
that  the  singularity  in  ?f0  emerges  as  a  three- 
dimensional  version  of  an  endpoint  singularity  to 
an  integral  representation  with  singular  kernel. 

We  saw  that  no  stngular  behavior  was  found  for  k 
away  from  the  "endpoint”,  i.e.,  not  on  Z. 
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Now,  using  the  symmetry 

£'*£')  no) 

in  Eq,  (25),  it  is  not  hard  to  see  that  the  first  term 
in  Eq.  (35)  is  of  the  form 


dv-lii f 

‘K'-t-  K'  -k  I3  J 


d3ic 


There  is  no  endpoint  singularity,  as  this  integral 
covers  the  entire  reciprocal  lattice.  Butjthat 
means  that  th  e  full  BZ  integration  _in  £J  it)  does 
not  give  a  singular  contribution  to  vEJ,k),  what¬ 
ever  k  is. 

Indeed,  we  are  now  in  a  position  to  make  a  pre¬ 
cise  statement  concerning  v£„(k): 


Urn  |v£„(k)|=>Jim  2ir|r3kr,kr,0)ln|k-kf||. 

i-i,  i*V 

(71) 


We  have  now  arrived  at  the  second  and  main  re¬ 
sult  of  this  section:  (ii)  In  partially  filled  band 
systems  the  gradients  of  all  band  energies  £„( it) 
diverge  logarithmically  whenever  k  is  on  the  Fer¬ 
mi  surface  SF. 

In  filled-band  systems  these  gradients  never  di¬ 
verge. 


B.  DOS  at  Fermi  level 

It  is  now  immediately  obvious  that,  indeed,  the 
DOS  vanishes  at  the  Fermi  energy  EF :  the  sur¬ 
face  S  becomes  Sr  for  £,,  The  integral  in  Eq. 

(30)  is  over  a  zero  integrand,  thus  giving  no  con¬ 
tribution  to  p(£r).  Obviously  this  result  holds 
whether  S,  is  either  multiply  connected  or  entirely 
within  the  BZ;  the  cause  of  the  singularity  is  the 
termination  of  the  k'  integration  short  of  the  full 
BZ. 

Our  result  of  Sec.  IV  A  has  an  interesting  im¬ 
plication.  U,  for  a  particular  E*Er,  the  equien- 
ergetic  surface  S  coincides  with  Sr,  then  the  as¬ 
sociated  (HE)  will  vanish  as  well.  We  are  unaware 
of  a  general  principle  that  precludes  this  to  hap¬ 
pen.  Although  it  seems  highly  unprobable,  we  have 
to  be  concerned  about  this  possibility.  We  will  re¬ 
turn  to  this  point  in  Sec.  VI. 


VI.  DOS  FOR  SYSTEMS  EXTENDED  IN  FEWER 
DIMENSIONS 


plies  to  these  lower -dimensional,  extended  sys¬ 
tems.  After  all,  one  can  imagine  these  systems 
to  physically  appear  if  one  allows  one  or  two 
unit  cell  dimensions  of  a  bulk  crystal  to  increase 
beyond  limits,  keeping  all  relative  position  vec¬ 
tors  s„  finite  in  length.  Periodicity  in  these  (in¬ 
finite)  dimensions  then  becomes  immaterial,  and 
we  have  effectively  a  collection  of  noninteracting 
extended  systems  of  lower  dimensionality. 

Theoretically  we  can  answer  this  question  in 
two  different  ways.  Using  the  MS  approach,  we 
can  formulate  the  HF  problem  for  polymers1* 
and  thin  films21  as  isolated  systems,  exploiting 
the  one-  and  two-dimensional  periodicities.  The 
Fermi  surfaces  are  two  points  in  polymers,  and 
a  planar  curve  in  thin  films.  Instead  of  the  three- 
dimensional  reciprocal  lattice  sums,  double  re¬ 
ciprocal-space  integrals  and  one  summation  ap¬ 
pear  for  polymers  (two  summations  and  one  in¬ 
tegral  for  thin  films).  An  analysis  similar  to  the 
one  given  here  can  then  be  carried  through. 

However,  a  physically  more  appealing,  albeit 
mathematically  somewhat  less  rigorous  approach 
is  to  start  from  the  crystal  HF  formalism  of  Secs. 
II-rV,  and  take  certain  limits  so  as  to  describe 
the  reduction  of  dimensionality  of  the  system. 
These  limits  should  be  associated  with  the  stretch¬ 
ing  of  unit  cell  sizes  in  one  (two)  directions  to  ob¬ 
tain  the  formulas  for  thin  films  (polymers)  in  the 
HF  description. 

First,  let  us  consider  the  transition  to  thin-film 
systems.  Thereto,  without  loss  of  generality,  we 
assume  a  monoclinic,  three-dimensional  unit  cell 
with  a  paralleilogram  basis  with  area  a0  and  height 
h.  It  then  follows  that 

v„=a0h.  (72) 

The  lattice  and  reciprocal  lattice  vectors  are  ex¬ 
pressible  as 

£=5,,-^;  (73) 

K=K,-K1,  (74) 

or  as  a  sum  of  vectors  parallel  and  perpendicular 
to  the  plane  of  the  unit  cell  base.  The  perpendic¬ 
ular  vectors  are  given  by 

R  L*mhz,  m  =  0,  *1,  ±2, . . .  ,  (75) 

Kk  =  d2v/h)2  ,  n«0,  ±1,  ±2 .  (76) 


Polymers  and  thin  films  are  extended  systems 
(usually)  periodic  in  one  or  two  dimensions,  re¬ 
spectively.  HF  calculations  at  the  same  rigorous 
level  as  bulk  crystal  work  are  being  performed,1’ 
or  are  underway.”  In  view  of  our  finding  that  the 
HF  DOS  in  crystals  vanishes  at  the  Fermi  level 
the  question  can  be  raised  whether  this  also  ap¬ 


if  it  is  assumed  that  the  unit  cell  base  is  parallel 
to  the  x-y  plane. 

The  obvious  next  step  is  to  take  the  limit  !i-«, 
keeping  the  sm  finite.  In  our  formulas  (17)-(27) 
this  limit  has  two  consequences.  Noticing  that  a 
factor  (l.'u0)  ‘s  associated  with  each  sum.  the 
first  effect  is  that  the  K  sum  becomes  very  dense. 
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i.e.,  In  the  limit  A  —  “  this  becomes  an  integral. 
Indeed,  we  can  write 


Formulas  generated  in  this  manner  are  identical 
to  those  obtained  by  a  direct  application  of  the  MS 
approach  to  thin  films. 

The  other  consequence  is  that  k'  integrations 
reduce  to  integrals  in  the  two-dimensional  BZ, 
with  it' -  it',,.  More  specifically 

The  A'  integral  will  scale  like  2ir/A.  The  occupa¬ 
tion  numbers  n„(k)  (contained  in  the  integrandsjaf 
the  it  integrations)  will  become  independent  of  kL 
in  the  limit  A-  ■»  since  its  length  vanishes.  There¬ 
fore,  we  can  say  that  in  this  limit,  the  k[  integral 
"fills  up"  the  entire  BZ  width  (2-/A)  in  the  per¬ 
pendicular  direction.  We  thus  conclude  that 

Again,  the  k'  integrals  generated  are  those  ob¬ 
tained  in  a  "direct”  slab  approach;  as  h  -  ”,  all 
k'  vectors  approach  a  vector  k]',  in  the  two-dimen¬ 
sional  BZ. 

But  now  we  can  immediately  state  that  the  re¬ 
sults  of  the  previous  section  regarding  the  den¬ 
sity  of  states  apply  to  thin  films  as  well.  Volume 
integrals  become  surface  integrals,  and  surface 
integrals  become  line  integrals.  In  particular, 
the  divergence  in  |^£,,(k)i  ^or  Partially  filled 
bands  [Eq.  (68)  J  will  hold  as  well  in  the  limit 
A-«.  Since  the  divergence  found  is  related  to 
the  distance  of  k  to  the  Fermi  surface,  this  re¬ 
sult  should  be  no  surprise. 

By  subsequently  taking  the  limit  that  a  two-di¬ 
mensional  unit  cell  dimension  approaches  infinity, 
we  generate  the  formulas  for  polymers.  Mutatis 
mutandis,  we  arrive  at  the  same  conclusion, 
namely,  that  in  a  partially  filled  band  polymer 
the  HF  DOS  vanishes  at  the  Fermi  level.  No  van¬ 
ishing  p (£)  occurs  in  filled  band  polymers.  In 
this  case  the  Fermi  surface  consists  of  two  points. 

Lest  the  reader  is  worried  whether  the  outlined 
treatment  concerning  the  DOS  is  sufficiently  rig¬ 
orous.  he  can  convince  himself  by  carrying  out 
the  analysis  directly  on  isolated  thin  films  (poly¬ 
mers).  For  the  exchange  terms  the  integrals 


over  resulting  from  (IT)  have  to  be  treated 
carefully  around  XL  =  0.  Exclusion  of  a  small  re¬ 
gion  [-«,<],  with  £>0,  will  "extract"  a  small, 
pancake -shaped  volume  in  reciprocal  space 
around  the  origin,  which  captures  the  singularity. 
To  that  volume  the  same  analysis  as  before  can 
be  applied,  giving  the  same  results. 

To  sum  up  then,  we  have  arrived  at  the  follow¬ 
ing  conclusions: 

(i)  The  HF  band-energy  gradients  VEJ, k)  asso¬ 
ciated  with  systems,  extended  periodically  in  one 
or  more  dimensions  and  possessing  partially 
filled  bands,  diverges  logarithmically  as  it  ap¬ 
proaches  a  Fermi  surface  vector  kr  No  such 
divergence  occurs  for  filled  band  systems. 

(n)  The  HF  density  of  states  p(£)  for  such  ex¬ 
tended  systems  with  partially  filled  bands  van¬ 
ishes  at  the  Fermi  level  Er  At  any  energy  E  *Er 
for  which  the  equienergetic  surface  S  [Eq.  (31)] 
coincides  with  the  Fermi  surface  S„  p(£)  will 
also  vanish. 

In  general,  p(£)  will  not  vanish  for  filled  band 
systems  with  £  not  in  band  gaps. 

Vfl.  DISCUSSION 

The  results  of  this  paper  concerning  the  HF  DOS 
have  a  number  of  interesting  and,  in  some  cases, 
somewhat  disturbing  consequences  for  calcula¬ 
tions  already  published  or  contemplated. 

With  regard  to  crystal  HF  work,  in  a  number  of 
cases  HF  DOS  are  presented  that  ought  to  show  a 
vanishing  p(EF ).  For  example,  the  DOS  for  cal¬ 
cium,28  lithium,’  and  TiO,10  show  a  conspicious 
absence  of  a  zero  HF  DOS  at  Er  In  view  of  the 
computational  methods  used  (polynomial  interpo¬ 
lation,  with  or  without  Gaussian  broadening,  or 
a  Gauss-Chebyshev  method,  formulated  by  Monk- 
horst  and  Pack23)  it  is  impossible  to  see  this  be¬ 
havior.  Moreover,  the  above  calculations  have 
been  performed  in  the  CS  approach.  The  trunca¬ 
tion  of  the  direct  lattice  sums  (forced  upon  for 
practical  reasons)  causes  the  p(£,)  to  not  even 
vanish  rigorously.  Yet,  the  intrinsic  extensive¬ 
ness  of  the  systems  considered  requires,  as  we 
saw,  the  HF  DOS  to  vanish.  We  therefore  con¬ 
clude  that  these  DOS  are  qualitatively  in  error, 
at  least  near  Er,  and  cannot  possibly  be  repre¬ 
sentative  of  the  HF  DOS  near  the  Fermi  level. 

We  also  found  that  for  an  energy  whose  equien¬ 
ergetic  surface  S  is  identical  to  the  Fermi  sur¬ 
face  Sr,  the  HF  DOS  should  vanish.  The  occur¬ 
rence  of  this  seems  highly  improbable  and  might 
even  be  impossible.  Certainly,  viewing  the  band 
structure  as  to  arise  from  a  perturbed,  nearly  - 
free-electron  (NFE)  model  one  seems  to  have  to 
rule  out  such  coincidence.  However,  two  remarks 
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are  in  order  in  this  connection.  First,  the  NFE 
model  is  inappropriate  (or  crystals  with  strong 
Bragg  scattering,  such  as  highly  localized  or¬ 
bitals  of  d  and  /  character.  Fermi  surfaces  in 
transition  element  compounds  are  notoriously 
complicated,  and  the  same  can  be  expected  of 
above  surfaces  S.  Second,  even  though  the  co¬ 
alescence  of  surfaces  S  and  Sr  seems  improbable, 
they  could  be  close.  If  that  happens  chances  are 
that  unphysical  shoulders,  peaks,  or  near  gaps 
could  appear  in  p (£).  The  reason  for  this  can 
be  found  in  FBB  [Ref.  14],  In  Fig.  3  of  their  sec¬ 
ond  paper  the  authors  present  the  HF  DOS  for  the 
NFE  gas,  both  the  exact  curve  and  numerical  val¬ 
ues.  Apart  from  the  inability  of  the  numerical 
method  to  accurately  reproduce  p(£)  near  £  n 
another  unsettling  observation  can  be  made:  p (£) 
peaks  at  an  energy  about  (.  f )  the  band  width.  This 
is  to  be  contrasted  with  the  steadily  increasing 
Hartree  p(£),  which  on  many  counts  is  closer  to 
the  “correct"  DOS.  The  kinetic  energy  decreases 
faster  with  decreasing  electron  density  than  the 
exchange  energy.  Therefore,  we  expect  this  pre- 
peaking  to  be  even  more  pronounced  at  lower  elec¬ 
tron  densities.  But  now  we  have  to  conclude  that 
unphysical  features  to  the  p(£)  curve  can  already 
occur  for  energies  E  with  equienergetic  surfaces 
5  quite  different  from  Sr  We  saw  a  pre-peaking 
in  the  NFE  gas  model.  We  thus  can  expect  shoul¬ 
ders,  peaks,  or  even  near  gaps  to  occur,  depend¬ 
ing  on  how  closely  S  approaches  Sr  and  what  the 
electron  density  is. 

All  correlation  corrections  have  concentrated  on 
the  Fermi  level  DOS  or  the  band  width  or  band 
gaps.  Our  results  indicate  that  with  complicated 
band  structures  such  as  for  transition  element 
compounds,  considerable  attention  has  to  be  paid 
to  the  correlation  problem  at  all  energies.  The 
fact  that  numerical  inaccuracies  tend  to  wash  out 
these  HF-caused  singularities  (or  near  singu¬ 
larities)  is  irrelevant,  although  expedient  in  prac¬ 
tice.  The  point  is  that  the  theory  allows  for  them, 
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and  therefore  they  should  be  either  shown  faith¬ 
fully  or  corrected  for.  Unfortunately,  so  far  no 
scheme  has  been  formulated  that  is  practical  and 
theoretically  sound. 

It  is  even  more  likely  that  thin  films  and  polymer 
HF  DOS  exhibit  such  unphysical  features.  In  the 
case  of  thin  films  it  is  quite  possible  that  two 
Fermi  curves  come  close  in  shape  and  area.  Since 
in  polymers  the  Fermi  surface  consists  of  two 
points,  there  is  an  infinite  number  of  energies 
(lying  within  bands)  for  which  “S  coincides  with 
Sr"  In  practice,  however,  this  is  not  very  rele¬ 
vant;  stable,  partially  filled  band  polymers  do  not 
occur,  although  a  filamentary  structure  of  equally 
spaced  polymers  of  hydrogen  atoms  has  been  pro¬ 
posed  and  studied  for  metallic  hydrogen.15 

Summing  up  then,  one  has  to  be  quite  cautious 
when  interpreting  detailed  features  to  p(£)  at  the 
HF  level,  particularly  for  complicated  crystals 
on  thin  films  with  partially  filled  bands.5'10  By 
analogy  with  the  widening  of  the  HF  NFE  gas  band 
width,  we  expect  considerable  exaggeration  of 
bands  and  gaps  near  £,.  No  vanishing  p(£)  .within 
bands  of  filled-band  systems  is  expected. 

Notwithstanding  these  fundamental  failures 'to 
the  crystal  HF  results  we  do  not  advocate  a  local 
approximation  to  exchange  and  correlation  cor¬ 
rections.  The  work  of  Overhauser  and  others5* 
has  suggested  that  a  considerable  nonfocality  has 
still  to  be  associated  with  these  corrections,  al¬ 
though  not  as  strong  as  "bare"  exchange.  The 
use  of  HF  as  a  first  approximation  has  many  vir¬ 
tues:  it  is  well  defined,  yields  variational  total 
energies,  and  gives  good  charge  densities.  Be¬ 
yond  that,  the  correlation  problem  for  extended, 
systems  is  still  with  us,  and  progress  has  been 
slow.25 
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Abstract 


Integrals  for  LCAO  calculations  on  thin  films,  with  or  without 
cverlayers,  have  been  rigorously  reduced  to  two-dimensional  reciprocal 
lattice  sums  and  an  infinite  integration.  Convergence  properties 
and  computational  effort  are  as  favorable  as  in  earlier  bulk  calcu¬ 
lations.  Hartree  and  Hartree-Fock  calculations  can  be  performed 
with  Slater  orbital  basis,  without  pseudo-potentials,  surface  clusters 
or  matching  planes.  The  method  should  be  ideal  to  study  surface 
states,  reconstruction  and  adsorption. 
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During  the  past  few  years  we  have  formulated-  and  implemented^ ’ J 

a  method  to  perform  ab  initio  Kartree-Fock  (HF)  calculations  on 

three-dimensional  (3D)  crystals.  The  essence  of  the  method  is  the 

reduction  of  all  crystal  integrals  over  Bloch  functions  to  oD  reciprocal 

lattice  summations  involving  Fourier  transforms  of  basis  atomic  orbitals 

only.  Madelung-type  conditional  convergence  is  no  problem,  as  the 

method  is  inherently  based  on  an  infinitely  sited  sample.  Exchange  was 

treated  exactly,  and  various  basis  functions  were  used,  such  as  Slaler- 

03 

type  orbitals  and  plane  waves.  Simple  metals  ’  and  molecular  hydrog<  n 
4 

crystal  were  considered  so  far.  Application  to  other  crystals, 

such  as  diamond  and  beryllium,  with  a  double-zeta  Slater  basis  including 

3d  orbitals  is  underway. 

I  wish  to  show  how  the  same  technique  can  be  used  to  obtain 
workable  expressions  for  the  exact  evaluation  of  integrals  in  the 
LCAO  method  wiien  applied  to  2D  crystals.  These  systems  comprise 
thin  films,  without  or  with  one  (or  more)  overlayers  of  foreign  material, 
and  with  long-range  2D  periodicity.  First  I  will  explain  the  essential 
features  by  treating  a  single  layer  of  atoms. 

Consider  N  hydrogen  atoms  occupying  the  lattice  sites  of  a  planar 
2D  lattice  that  is  characterized  by  the  lattice  vectors 


(i)  ^  +  "/a* 


and  are  primitive  vectors  in  the  2D  lattice  with  unit  area 
a„  =  h*,«hU  In  the  same  spirit  as  our  3D  crystal  work  I  express  the 

o  1  c. 

Schrodinger  hamiltonian  H  as  follows  in  atomic  units  (]4  =  m  =  e  =  1  '> : 


N 


N 


2)  H  =  Z  (-sVi)  +  S  h(ri>rj), 


i=l  -  i> j=l 

where,  as  N  approaches  infinity, 


(3)  h(rx,r2)  =  r^1  -  ^  S(  | ^_  -  +  |f2  - 

m 


m2 


n2 1 


A  2D  Bloch  function  based  on  a  single  orbital  at  each  lattice  site 
can  be  expressed  as 

iiu*  F?  0 

(4)  j  k2>  =  2  e  cd(  r  - 

m  '"‘~ 


k0  is  the  31cch  vector  in  the  2D  Brillouin  zone  associated  with  the 
*■  n  . 

2D  direct  lattice.  The  HF  expectation  value  of  the  Schrodmger 

haniltonian  of  Eq.(3)  now  involves  one-  and  two-electron  integrals 
over  2D  Bloch  orbitals  |kL)  that  I  will  reduce  to  computationally 
more  convenient  forms.  I  follow  the  same  procedure  as  in  our  earlier 
3D  work.  All  integrals  will  be  expressed  in  a  Fourier  representation. 
Then,  after  having  interchanged  lattice  summations  and  Fourier  argu¬ 
ment  integrations,  I  make  use  of  the  relation 


(5) 


,  — r  tr*  r 

„  /iq,R.n2  ...  (2 tt)* 


u  e 
m 


2  5(q^  -  $  ) 
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is  the  projection  of  q  onto  the  2D  reciprocal  lattice  with  vectors 
f?  .  Eq.(5)  holds  in  the  limit  of  N  approaching  infinity.  Let  me 
give  some  examples.  An  overlap  integral  is  reduced  as  follows 


(6)  <i?2|?2> 


N  £  e 
m 


<cp(D|cp(?  -  R^)) 


—  Jaq  cp^(  q)  cp  r( -q) 
(2n)3 


i(£,  -  q*).^ 

X  2  e 

m 


Application  of  Eq.(5)  leads  to 


<7>  <^l^>  -  Is;  jV.  *2  ^ 
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q^is  the  q  component  perpendicular  to  the  2D  reciprocal  lattice 

characterized  by  K^.  If  cp  is  a  Slater-type  orbital  the  q  ^ 

integration  can  be  performed  analytically.  Depending  on  the  2D 

crystal  structure  the  2D  sum  can  probably  be  performed  analytically. 

The  kinetic  energy  integrals  reduce  to  very  similar  exp-essions. 

Before  going  to  the  electrostatic  integrals  I  would  like  to 

make  some  remarks.  Notice  that,  due  to  a  lack  of  lattice  periodicity 

in  the  direction  perpendicular  to  the  layer  plane,  we  obtain  in 

Eq.(7)  an  infinite  integral  in  lieu  of  a  sum  encountered  in  the  3D 
12  3 

crystal  case  ’  ’  .  In  fact,  one  can  derive  all  2D  formulas  in  this 

paper  by  starting  from  the  earlier  3D  formulas  and  allowing  one  3D 

cell  dimension  (say  parallel  to  qj_ )  to  become  infinitely  large. 

Physically  this  generates  non-interacting  layers  of  atoms,  each  of 

which  is  described  by  a  Hamiltonian  like  Eq.(3)>  Furthermore,  from 

a  numerical  standpoint  Eq.(7)  is  not  harder  to  evaluate  than  the 

T 

earlier  3D  version.  Instead  of  evaluating  cp  at  3D  reciprocal  lattice 
points  and  then  forming  the  3D  sum,  we  have  to  compute  these  functions 
at  a  3D  grid  (q^,  )  with  qj^values  dictated  by  the  numerical  quadrature 

used.  Clearly  the  convergence  properties  will  be  essentially  the  same 
for  similar  types  of  atoms.  I  want  to  stress  that  convergence  for  all 
3D  core-valence  and  valence-valence  integrals  is  invariably  found  to 
be  very  favorable. 

I  will  now  turn  to  the  expressions  for  Coulomb  and  exchange  matrix 
elements.  For  those  integrals  the  power  of  the  Fourier  transform 
method  shows  again  most  impressively.  In  reducing  the  one-  and  two- 
electron  integrals  I  made  use  of  the  following  formuals. 

(3) — i —  =  7T  J  e'^ 

2tt  *  q 

(9)  [  dSr(?)|?  -  $\ _1  =  7-2  J' 

2rr  q 


-k- 


'  10)  Jd?1.i?2f(?1)(7i-)g(?2  -  ff)  ,  _L  j-  fT(  J)sT(-5Vi3',B’ 

12  2r  q 


l'  and  g  represent  lattice  sums  of  two-center  overlap  distributions. 
Skipping  details  it  should  now  be  easy  for  the  reader  to  verify  the 
following  expressions. 

-  1 

(11)  <k  it'|h|i?£’  >  =  JL  cp*(5Lo)«(C',o)  +  §2  r  do  z  (k  2+  q  2) 

^  ^  d  2jf  d  d  d  a0  X  l  12  *■ 


[5(i?2^i_+  ^12  )$  '  *2  *  ’  "^lT  ^2}  *  +  ^2* ,0^ 

-  $0?2,0)$(lc2,,-qL-  f?i2)  +  6i0*(^0)*(^a',0)] 


(12)  0=^-2  i 


2  a0  ^  Kt2  J  % 

Kl2fa 


(13)  »<£,,?)  =  i—  fdpA  t  ^  *  i?2)  /(?  -  i?  -  ^  -  f£i 


(1M  <EJ2’lr12'l|it,%>  -f2  J  dqAE 


*(£,,-?)*(  It*, Q*) 


where 


(15)  S' *  ?A+  j?<2  *  £>' 


"1  ^ 

By  comparing  Eqs.(ll)  to  (15)  with  earlier  expressions  for  3D  crystals 
one  will  notice  striking  similarities.  Just  as  I  indicated  for  the 
overlap  integrals  the  numerical  efforts  will  be  essentially  the  same. 

The  lattice  constant  is  the  2D  analog  of  our  earlier  constant  in 
Ref. 2(a);  it  is  the  potential  of  the  2D  lattice  of  unit  charges  at 
the  lattice  points  balanced  by  a  uniform  2D  sheet  of  compensating 
charge  in  the  plane  of  the  lattice.  We  already  encountered  this 
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quantity  in  the  3D  work  ,  where  we  developed  reliable  computational 
methods  for  its  evaluation. 

2qs.(7)  and  (12)  thru  (15)  exemplify  the  types  of  integrals  that 
will  occur  in  Hartree  or  HF  calculations  for  a  single  layer  of  hydro¬ 
gen  atoms.  One  can  formulate  a  self-consistent  calculation  by  descri¬ 
bing  the  2D  Bloch  function  as  a  linear  combination  of  basis  Bloch 
functions  like  Eq.(3)  with  appropriately  chosen  cp's  'the  LCAO  method). 
Clearly  the  crystal  integrals  over  the  different  basis  orbitals  can 
be  expressed  in  a  similar  fashion.  As  we  have  done  in  the  3D  case, 
the  expansion  coefficients  are  then  computed  self-consistently. 

Let  me  now  indicate  how  easily,  both  formally  and  practically, 
above  technique  can  be  applied  to  realistic  thin  films.  Consider 
a  2D  basis-Bloch  orbital  | k^p)  given  by 

(16)  (l^p)  =  2  e  ^  ^  <p  (r  -  ^  -  ?  ) 

m 


associated  with  the  p-th  atomic  orbital  at  relative  positions  s 

P 

within  each  2D  unit  cell.  In  general  s^  is  given  by 


(17) 


s  .  and  s  are  the  components  orthogonal  and  parallel  to  the  plane 
Pi-  P2 

of  2D  periodicity,  respectively.  As  before  actual  2D  Bloch  functions 
are  then  expressed  as  linear  combinations  of  |  k*np)  •  Integrals  over 
these  functions  are  now  very  simply  related  to  those  obtained  above. 
For  example,  an  overlap  integral  is  given  as 


(18)  (Vl^t)  -agr 


*  ( s*t 


-v 


J'  lq> 


-  s  ) 

Pi 


X 


Co*  ( 


2  e 


22  '  t2 


2 


*2 


T 

cpt  v 


li. 
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When  Slater  orbitals  are  used,  the  q^integration  can  be  performed 
analytically,  if  so  desired.  The  remaining  2D  reciprocal  lattice 
sum  is  then  found  to  be  exponentially  convergent.  The  most  remarkable 
aspect  is  the  great  similarity  between  Eqs.(7)  and  (18),  particularly 
from  a  numerical  point  of  view.  The  reader  should  be  able  to  convince 
himself  that  the  came  holds  for  all  other  integrals  above  when  evalu¬ 
ated  for  general  2D  Bloch  orbitals.  The  convergence  properties  will 
clearly  be  very  similar  as  well.  The  oscillatory  character  of  the 
summand  (and  integrand)  will  not  cause  numerical  troubles,  because 
typically  their  magnitudes  fall  off  rapidly. 

This  formalism  obviously  allows  one  to  perform  calculations 
on  a  large  number  of  thin-film  like  systems.  By  increasing  the  num¬ 
ber  of  layers  of  a  material,  and  using  an  appropriate  basis,  one 
can  monitor  the  emergence  of  surface  states  in  a  self-consistent 
calculation.  Surface  contraction  and  rearrangement  can  be  studied 
since  the  energy  can  be  reliably  optimized.  One  or  more  overlayers, 
with  different  coverages,  can  be  brought  onto  the  surfaces.  As  we 
will  obtain  Hartree  or  HF  2D  band  energies  and  charge  densities 
many  surface-related  properties  can  be  computed.  By  considering 
low  coverages  adsorption  can  be  studied.  All  types  of  orbitals,  in¬ 
cluding  3d  functions,  can  be  used.  As  found  earlier,  it  will  be  ad¬ 
vantageous  to  use  direct  lattice  sums  for  core-core  matrix  elements^. 

When  comparing  with  other  methods  one  will  recognize  chat  our 

method  will  not  be  plagued  by  finite-cluster  problems^,  matching 

,  6  7 

planes  f  or  pseudo-potential  choices  in  the  surface  region.  Possible 

disadvantages  are  associated  with  the  use  of  thin  films  to  mimic  the 

behavior  of  surfaces  on  half- inf inite  samples.  The  identification 

of  surface  states  (or  bands)  from  2D  band  structure  calculations  wil.1 

be  non-trivial.  However,  it  is  reassuring  that  typical  "healing" 

is  only  over  a  few  atomic  layers^,b.  This  suggests  that  it  is  pnysically 

realistic  to  use  thin  films  with  only  a  few  layers  to  calculate  surlace 

6 

properties  .  Also  it  is  not  clear  how  low  a  coverage  of  adsorbate 
we  can  numerically  handle  to  describe  s  ingle-adsorbate/ sub.; t rate 
intei-action.  More  theoretical  and  r.  merical  results  will  be  published 
e lsewhere. 


Id 
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Hartree-Fock  formalism  for  the  calculation  of  total  energies 
and  charge  densities  of  thin  films 
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A  formalism  has  been  developed  by  which  the  Hartree-Fock  self-consistent  equations  for 
thin  films  and  surfaces  are  solved  in  an  essentially  exact  way  from  first  principles.  The 
Fourier  transform  method,  which  was  developed  and  used  very  successfully  for  the 
calculation  of  bulk  properties,  has  now  been  extended  to  permit  the  study  of  the  surfaces  of 
crystals  of  a  wide-  variety  of  materials.  The  development  is  sufficiently  general  to  permit  the 
treatment  of  any  surface  of  any  crystal  geometry  together  with  relaxations  and 
reconstructions.  A  basis  set  of  Slater-type,  layer  orbitals  is  used  to  compute  a  self-consistent 
density  matrix  for  electrons  in  the  field  of  bare  nuclei  in  a  slab  geometry.  The  basis  functions 
are  handled  in  the  Fourier  representation  which  reduced  many  of  the  integrals  to  sums  of 
one-dimensional  ones  which  are  computed  in  closed  form.  The  use  of  the  Fourier  integral 
representation  also  permits  an  analytical  treatment  of  the  cancellation  of  the  Coulomb 
singularities  in  the  classical  potential. 

PACS  numbers:  73.60.  —  n,  71.10.  +  x,  73.20.  —  r 


I.  INTRODUCTION 

The  basis  and  the  starting  point  for  most  one-electron  calcu¬ 
lations  for  extended  systems,  including  tight-binding  methods 
and  pseudopotential  methods,  lies  in  the  Hartree-Fock  self- 
consistent  equations.  It  has  been  traditionally  accepted  that 
the  Hartree-Fock  equations  for  extended  systems  are  not  to 
be  solved  exactly,  “even  by  machines.”1  Although  this  is  no 
longer  the  case,2*7  these  solutions  have  not  been  carried  out 
for  systems  with  surfaces.  The  thought  that  such  a  thing  might 
be  possible  or  even  that  it  might  be  enlightening  is  seldom 
evident  in  the  literature.  Beside  the  fact  that  the  Hartree-Fock 
equations  are  difficult  to  solve,  it  is  quite  well  known  that  the 
solutions  have  certain  unphysical  properties,  such  as  the 
vanishing  of  the  density  of  states  near  the  Fermi  level.'1  Since 
we  are  often  interested  in  understanding  details  of  some 
spectroscopic  data,  these  unphysical  properties  make  the 
method  seem  unattractive.  For  correlating  and  extrapolating 
spectroscopic  data,  pseudopotential  or  tight-binding  methods 
which  do  not  suffer  from  any  of  these  difficulties  and  offer 
some  simple  insight  are  often  more  appropriate.  These 
methods  have  been  developed  extensively  and  applied  re¬ 
peatedly  to  many  different  kinds  of  interface  problems  with 
great  success. 

However,  a  number  of  things  can  be  learned  from  the 
Hartree-Fock  solution.  In  order  to  assess  the  plausibility  of 
various  surface  reconstruction  models  it  would  be  very  helpful 
to  have  a  reliable  estimate  of  the  relative  total  energy  of  closely 
related  structures.  There  are  a  few  methods  which  seem  to  be 
able  to  produce  such  estimates  from  rather  simple  force- 
constant  models.9  Neither  the  microscopic  foundation  for 
these  models  nor  the  reason  for  ‘heir  apparent  success  is 


especially  clear,  although  it  is  felt  that  much  of  the  forct- 
constant  energy  results  from  core-core  and  core-valence  in¬ 
teractions.  This  ability  to  determine  relative  total  energies  u 
vitally  important,  because  together  with  low  energy  electron 
diffraction  and  angle  resolved  photoemission  it  provides  a 
means  of  discriminating  between  surface  geometries.  It  is 
known  from  the  large  body  of  experience  with  molecules  that 
the  important  part  of  the  relative  energy  which  determines 
the  geometric  structure  (to  within  structures  related  by  small 
displacements)  is  given  by  the  Hartree-Fock  solution  10  By 
using  the  density  matrix,  one  can  partition  the  energy  into 
various  contributions  and  in  this  way  get  a  more  detailed 
understanding  of  the  mechanisms  which  determine  geometry 
At  the  same  time,  one  might  hope  to  understand  the  simpler 
methods  in  a  more  fundamental  way.  Since  the  converged 
Fock  matrix  for  a  particular  problem  contains  the  Coulomb 
and  exchange  potentials  as  a  function  of  the  translation 
quantum  number,  it  is  interesting  to  compare  this  matrix  with 
various  model  potentials.  Also,  since  the  one-electron  charge 
densities  are  quite  realistic,  one  can  find  the  difference  in 
electrostatic  potential  between  infinity  and  some  point  inside 
the  bulk,  and  this  defines  the  work  function  Finally,  the 
Hartree-Fock  solution  is  a  solution  of  a  well  defined  problem 
which  is  related  to  the  tiue  solution  of  the  manv-electron 
problem  in  quite  a  specific  way.  The  nature  of  this  relation 
is  such  as  to  make  the  Hartree-Fock  solutions  a  convenient 
starting  point  for  any  more  ambitious  project  which  would 
take  correlation  effects  into  account. 

The  use  of  Fourier  representation  is  the  key  to  the  treatment 
of  extended  systems.  There  are  at  least  three  reasons  for  using 
momentum  representation.  First,  on  account  o|  ;he  coie  '- 


J.  Vac.  Set  Tachnot,  1«S).  Sapt/Oct  1*79  0022-5355/79/09 1319-05*01.00  $  1979  American  Vacuum  Society 


319 


Harris,  Monkhorst,  and  Schwalm:  Hartree-Fock  formalism 


1319 


jtion  properties,  various  one  and  two-electron  integrals  over 
jie  Coulombic  mteractic.is  will  naturally  decompose  into 
roducts  of  simpler  integrals.  Secondly,  due  to  translation 
viiunetry  in  the  layer,  each  of  the  remaining  integrals  further 
imposes  into  a  two-dimensional  sum  over  one-dimensional 
•legrals  in  the  dimension  perpendicular  to  the  layer.  These 
;e  done  in  closed  form.  After  doing  the  integrals,  the  con- 
ergence  of  the  sum  is  quite  satisfactory.  Another  good  reason 
jr  doing  the  calculation  this  way  is  that  singularities  in  the 
•assical  Coulomb  terms  arising  from  the  Jatrice-plus-Hartree 
art  of  the  potential  may  be  treated  explicitly. 

This  paper  is  devoted  to  a  description  of  the  formalism 
jting  used  to  investigate  thin  films  composed  of  light  element 
atoms  and  molecules.  Eventually  this  method  will  be  extended 
■ ,  A  treatment  of  semiconductor  surfaces  and  interfaces. 

U,  'lUTUNE  OF  THE  METHOD 

The  method  is  a  generalization  of  the  Fourier  transform 
method  which  has  been  used  in  the  bulk  with  notable  suc¬ 
cess.2'5  We  have  used  a  basis  set  expansion.  Layer  orbitals  are 
constructed  from  Slater-type  orbitals  in  the  usual  way. 
However,  these  are  then  projected  onto  plane  waves  and  the 
entire  computation  is  carried  out  in  momentum  representa¬ 
tion.  It  should  be  noted  that  this  is  completely  different  from 
any  attempt  to  expand  the  potentially  localized  electronic 
structure  of  these  types  of  systems  in  plane  waves.  By  using 
all  of  the  infinite  numbers  of  Fourier  components  in  the  Sla- 
icr-type  basis  set,  the  convergence  properties  are  preserved 
in  the  case  of  localization. 

Complete  definition  of  the  Hamiltonian  requires  only  the 
identification  of  the  chemical  species  and  the  specification 
v?  the  geometry,  that  is,  of  the  positions  of  each  of  the  atoms 


i 


1  Definition  of  geometry  showing  the  built  lattice  vectors,  layer 
“wnbers,  and  a  non  primitive  displacement.  Note  that  alarm  on  the  left,  rear 
have  been  omitted  for  clarity  as  have  thoae  in  the  other  nonprimitive 


in  the  infinite  system.  The  fundamental  unit  of  the  basis  set 
is  a  layer  orbital  corresponding  to  a  translation  symmetry- 
index  p”  in  the  plane  of  the  layer: 

|p",n,«)  =  £  exp(ip'  •  Rm)|m,n,«>,  (1) 

m 

where  the  symbol  jm.n.a)  is  a  Slater  orbital  of  type  a  cen¬ 
tered  at  a  position  specified  by  m,n,  and  a  in  a  way  illustrated 
in  Fig.  1.  The  index  n  denotes  the  layer,  m  is  a  vector  index 
within  each  layer,  and  u  which  denotes  the  basis  function  type 
is  also  used  as  an  index  which  may  indicate  a  nonprimitive 
displacement  within  the  unit  cell.  The  sum  in  Eq.  (1)  is  thus 
over  all  lattice  positions  in  a  layer,  but  not  over  nonprimitive 
positions  and  not  over  the  different  layers. 

Since  atomic  positions  may  differ  in  the  selvedge  region  as 
compared  to  the  bulk  or  inner  layers  of  the  film,  the  position 
vector  may  be  broken  up  into  two  parts,  only  one  of  which  is 
determined  by  the  bulk  basis  vectors.  Figure  1  illustrates  the 
simple  case  with  the  selvedge  region  the  same  as  the  inner 
layers.  Actually,  any  specific  number  of  layers  may  be  set 
aside  as  having  orbitals  different  in  both  position  and  form 
from  those  of  tbebuik.  Novice  iiui  Tig.  1  has';een  simplified 
in  order  to  reduce  the  number  of  lines  and  make  it  more 
comprehendable.  There  would  be  atomic  sites  along  the 
dotted  edge  in  the  rear  as  well  as  other  nonprimitive  sites 
within  each  cell  congruent  to  the  one  pictured  on  layer  4. 

Matrix  elements  of  the  Fock  matrix  are  calculated  explicitly 
with  respect  to  this  basis  set.  We  have  chosen  to  use  Slater-type 
functions  rather  than  Caussians  in  our  calculations  because 
these  represent  the  solutions  well  in  the  regions  of  the  nuclei 
and  because  there  is  no  advantage  to  using  Caussians  in  the 
Fourier  representation. 

Evaluation  of  integrals  is  facilitated  by  using  plane-wave 
components  <k|p",  n,a)  defined  with  respect  to  normalized 
plane  waves. 

<k|  p*,n,a>  *  J* e~,i't  d^r.  (2) 

Clearly  this  is  zero  unless  the  parallel  component  k"  of  the 
transform  variable  is  congruent  by  reciprocal  layer  vector  to 
p*  It  is  this  property  which  reduces  inner  products  to  sums 
of  one  dimensional  integrals.  In  fact 

<k|p\n,a>  -  A(k*  -  p')(k| p'.n.a),  (3) 

where  the  capital  delta  symbol  is  defined  to  be  a  sum  over 
Kronecker  symbols  at  each  point  K"  on  the  reciprocal  layer 
lattice. 

A(k'-p')-£5K*jk'-P"  (4) 

K” 

The  basic  problem  is  to  solve  the  selt-consistent  equations 
which  arise  in  matrix  form  for  each  value  of  the  translation 
quantum  number  p'  These  p'  values  are  taken  to  be  quad¬ 
rature  points  which  will  be  used  to  sum  over  the  continuum 
of  filled  states.  At  each  one  of  these  points,  the  th  eigenvector 
is  expanded  in  layer  orbitals 

Ip'.u)  *  E  <N.««  (p*)  |p'.n.a>i  (5) 

*.a 
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and  the  coefficients  must  satisfy  (in  matrix  form) 

F(p’)c,.(p’)  =  £,(p')S(p')c.(p').  (6) 

The  first  set  of  questions  to  be  asked  concerns  the  way  in 
which  each  term  of  the  Fock  matrix  is  defined. 


F(p’)  *  T(p")  +  V(p")  +  C(p')  +  X(p').  (7) 

The  overlap  matrix  is  simply  given  by 

S,„.„v(p')  *  <p'.n,«|p',n'./J).  (8) 

The  kinetic  energy  T  has  the  form  (in  atomic  units) 

T,„.b .j(p')  =  /  (p*.ft.«|k>(V2)^<k|p'.n',/J>  d'k.  (9) 

The  terms  which  provide  for  the  interaction  between  an 
electron  and  the  lattice  or  between  an  electron  and  the  other 
electrons  can  be  written  most  easily  as  convolutions  involving 
the  distribution  matrices.  The  matrix  elements  of  these  are 
the  transforms  of  the  products  of  layer  orbitals,  or  equiva¬ 
lently,  convolutions  of  the  transforms  of  layer  orbitals. 

*  S  <p*,n,a|t)<t  +  k|  q'.n'.d)  d3t. 

(10) 

In  Eq.  ( 10),  the  product  in  real  space  has  become  a  convolution 
in  k  space.  As  a  result  of  this  convolution  and  of  translation 
symmetry,  a  rather  pleasant  decoupling  of  terms  results  which 
is  easily  comprehended.  The  lattice-plus-Hartree  terms  are 
of  the  form 


V(p*)  +  C(p’)  =■  /  W(k)Q(p’.p’.k)  [4(k)  -  2^k)id3fc, 

(11) 


W(k) 


I  1 

2  IT*  k2  ' 


(12) 


a(k)  »  21  z„etk  ■  (13) 

fl.lT 

where  W(k)  is  the  transform  of  the  Coulomb  kernel,  a(k)  is 
the  structure  factor  for  the  bare  nuclei  with  charges  and 
«(k)  is  a  similar  structure  factor  defined  in  terms  of  the  self- 
consistent  electroic  charge  density.  The  factor  of  2  comes 
about  due  to  double  occupancy  of  the  levels,  one  electron  per 
spin  state 

In  order  to  define  the  electronic  structure  factor  e(k)  or  any 
other  expression  depending  upon  the  solution,  the  density 
matrix  Dtp*)  must  first  be  defined.  The  matrix  elements  of 
Dtp')  are  expressed  in  terms  of  the  coefficients  appearing  in 
Eq.  (5)  as 


Dna.»  .dp*)  *  I  (p ’ V».fi  dip * )0,,(p ' ),  (1-4) 

If 

the  sunu  tseing  over  all  eigenstates  and  the  9,  *  1  if  the  ei¬ 
genstate  energy  is  below  the  Fermi  level  and  zero  otherwise, 
Then  at  k )  becomes 

f  "»ceiDfq*W.q’,— k)MV-  (15) 

t2r)i  Jmnt 

The  integral  being  over  the  two-dimensional  Brillouin  zone. 
A  similar  expresMon  holds  for  the  exchange  matnx.  except  that 
here  the  translation  quantum  numbers  are  coupled  in  a  more 
complicated  wav. 
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X  Qfq'.p".  -k)  d2q"d3k.  (16) 

The  reason  for  this  difference  between  the  Coulomb  and  ex¬ 
change  parts  is  quite  clear,  as  illustrated  in  Fig  2.  Because  the 
desired  solutions  have  translation  symmetry,  at  each  stage  in 
the  iteration  the  density  matrix  will  preserve  q"  Likewise, 
the  entire  Fock  matrix  conserves  p*  so  that,  in  analogy  with 
the  electron  gas,  the  interaction  either  preserves  or  exchanges 
the  translation  quantum  numbers  p*  and  q*.  In  the  case  of  ! 
the  classical  term  C,  the  fact  that  only  a  single  translation  | 
variable  p*  occurs  in  either  distribution  means  that  far  fewer  1 
Q  terms  need  be  computed.  _ 

A  very  important  cancellation  takes  place  in  Eq.  (11)  for 
the  classical  potential  The  integration  over  either  atk)  or  2atk) 
by  itself  would  clearly  diverge  in  view  of  the  fact  that  these 
are  the  contributions  due  to  the  separate  positive  and  negative 
charge  distributions,  which  are  infinite  in  extent.  This 
Madelung-like  cancellation  will  be  discussed  below. 

III.  EVALUATION  OF  INTEGRALS 

Consider  first  the  integrals  [Eq.  ( 10)  |  which  form  the  ele¬ 
ments  of  the  distribution  matrix  for  a  given  p",  q",  and  k.  Due 
to  the  translation  selection  rule,  the  integrand  of  one  of  these 
integrals  is  zero,  unless  p"  +  k*  —  q*  is  a  reciprocal  layer 
vector.  Here,  k*  is  the  component  of  the  Fourier  transform 
variable  k  parallel  to  the  layer.  Moreover,  even  with  this 
condition  satisfied,  the  only  contribution  to  the  integral  comes 
from  t"  close  to  p*  +  K*,  where  K*  is  some  reciprocal  layer 
vector.  There  are  peaks  ir.  the  integrand  resulting  from  the  | 
sum  over  phase  factors  within  the  layer  orbitals,  and  these  are 
centered  near  p”  +  K*  for  each  K*  and  have  a  width  of  the 
order  of  the  reciprocal  of  the  linear  size  of  the  film,  which  we 
will  finally  take  to  infinity.  Taking  this  limit  transforms  the 
distributions  into  sums  over  one-dimensional  integrals  in  the 
direction  perpendicular  to  the  film. 

Now  we  note  the  form  of  these  one-dimensional  integrals. 
Since  the  layer  orbitals  are  composed  of  Slater-type  functions 
whose  Fourier  transforms  are  simple  combinations  of  rational 
functions,  these  integrals  may  be  done  in  closed  form.  In  fact. 
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a  t\  pical  integral  is  a  sum  of  a  small  number  (two  or  three)  of 
functions  of  the  type 


/rWa.M.M) 


x: 


t  ee'ut  dt 


(t-  +  a-)ml(f  -  k )2  +  b-]"  ' 


(17) 


where  the  integration  variable  t  is  the  z  component  of  t  and 
the  k  parameter  is  minus  the  z  component  of  k.  The  param¬ 
eter  a  is  the  square  root  of  the  sum  of  the  square  of  the  Slater 
exponent  of  a  and  the  square  of  the  length  of  p*  +  t",  b  is  a 
similar  expression  with  «  replaced  by  3  and  p"  +  t"  replaced 
by  q*  +  t"  +  k*.  The  p  of  this  expression  is  the  perpendicular 
distance  between  the  two  layers  n  and  n'  which  appear  in  the 
definition  of  the  Q  matrix  elements  [Eq.  (10)).  The  ]’ s  may 
be  integrated  by  residues  in  the  standard  way  and  a  result  for 
the  general  case  is  obtained. 

The  one-dimensional  integrals  may  be  summed  over  t"  to 
produce  Q,  as  is  convenient  when  p*  =  q",  or  sometimes  it  will 
be  wise  not  to  do  this  summation.  The  reason  why  one  may 
not  wish  to  sum  here  is  that  the  integral  over  k  in  Eq.  ( 16)  may 
also  be  done  in  closed  form.  This  is  due  to  the  fact  that  the 
values  of  the  integrals  over  t  resulting  from  the  residues  of  Eq. 
(17)  are  in  fact  rational  functions  of  k.  When  two  of  them  are 
multiplied  with  W(k),  the  result  is  a  rational  function  with 
only  a  few  poles  which  are  due  to  explicit  factors  of  the  de¬ 
nominator. 

In  the  case  when  the  summation  to  form  Q  is  done  first,  it 
must  be  performed  numerically.  Although  these  sums  con¬ 
verge  quite  well,  the  k  integrals  must  then  also  be  done  nu¬ 
merically. 

Another  difference  between  the  Coulomb  and  exchange 
terms  becomes  evident  with  consideration  of  the  integrations 
over  the  two-dimensional  Brillouin  zone  of  the  translation 
quantum  number  q Integrals  of  this  type  appear  in  Eqs.  ( 15) 
and  (16).  It  is  clear  that  these  integrations  must  be  done  nu¬ 
merically,  since  the  value  of  the  integrand  at  each  point  results 
from  the  solution  of  the  secular  Eq.  (6).  The  fact  that  all  of  the 
functions  appearing  in  these  integrands  must  be  periodic  with 
respect  to  q"  provides  a  natural  technique  for  this  numerical 
integration.  Each  integrand  may  be  expanded  in  a  Fourier 
series.  The  coefficient  of  the  constant  term  is  proportional  to 
ihe  value  of  the  function  over  the  Brillouin  zone.  The  coeffi¬ 
cients  of  any  fixed  number  of  terms  in  the  expansion  may  be 
found  by  summation  over  a  grid  of  quadrature  points  spaced 
equally  over  the  zone.  The  quadrature  is  thus  exact  up  to  a 
fixed  number  of  Fourier  coefficients.  This  method  is  closely 
related  to  Gauss-Tchebvchev  quadrature  extended  to  two 
dimensions.  This  method  may  be  applied  to  both  the  integrals 
of  the  tvpe  appearing  in  e(k)  and  the  type  appearing  in 
»p'). 

There  is  an  extra  simplification  which  reduces  the  two- 
dimensional  integrals  of  the  Coulomb  type  still  further.  It  has 
been  pointed  out  by  Monkhorst  and  Pack  that  in  the  special 
but  very  important  case  when  the  integrand  transforms  ac¬ 
cording  to  the  identity  representation  of  the  point  group  of 
the  surface,  then  the  integration  may  be  performed  using  only 
the  small  subset  of  the  quadrature  points  which  lie  within  the 
irreducible  wedge  of  the  zone. 11  Integrals  of  the  V(p')  + 
C(p')  terms  are  of  this  form,  and  this  can  he  demonstrated  as 


follows: 

Consider  the  integrand  of  Eq.  (.15).  It  is  not  obvious  that  the 
integration  variable  q’  mav  be  rotated  through  a  s\  mmetrv 
operation  g  in  the  plane  of  the  layer  without  affecting  the 
value  of  the  integral.  First  of  all,  the  matrix  Q  will  undergo 
a  similarity  transformation  with  respect  to  a  representation 
matrix  of  g.  However,  D  will  transform  the  same  way,  and 
the  rotation  matrices  will  multiply  to  unity  because  of  the 
trace  operation.  This  will  produce  an  expression  exactly  like 
Eq.  (15),  except  that  k  will  be  transformed  by  g.  The  integral 
is  thus  invariant  with  respect  to  symmetry  operations  on  k. 
This  shows  that  in  the  integrand  of  the  integral  of  Eq  ill) 
e(k)  is  symmetric  in  k.  Thus  it  may  be  replaced  by  the  aver¬ 
age  of  itself  over  all  k  related  by  operations  of  the  group. 
When  this  is  done,  the  integrand  in  Eq.  (15)  now  becomes 
symmetric  in  q*  as  required. 

It  is  unfortunate  that  the  p*  in  the  integrand  of  Eq.  ( 16) 
makes  a  similar  simplification  impossible  for  the  exchange 
terms.  These  terms  must  be  treated  separately. 

The  final  set  of  integrals  to  be  considered  are  those  over 
the  three  components  of  k  in  Eqs.  (11)  and  ( 16).  The  selection 
rule  (Eq.  (3)|  for  (k|p'’,n.«>  is  written  in  terms  of  Kroneekers 
delta  symbols.  Under  certain  conditions,  these  deltas  also  have 
Dirac  delta  function  properties  with  respect  to  integration, 
but  one  must  use  caution  since  there  is  an  interesting  size  ef¬ 
fect  which  turns  out  to  have  a  physical  meaning.  As  noted 
above,  these  delta  functions  have  a  width  when  considered 
as  a  function  of  k"  which  depends  inversely  upon  the  linear 
size  of  the  layer. 

The  details  of  the  reduction  of  the  Coulomb-plus-lattice 
matrix  elements  to  the  form  presented  in  (11)  have  been 
treated  quite  carefully. i12  The  delta  functions  in  Eq  (3)  come 
about  because  of  a  sum  over  phase  factors  embedded  in  the 
definition  of  the  layer  orbitals  in  Eq.  (1).  If  these  deltas  can 
be  thought  of  as  being  proportional  to  Dirac  deltas,  then  the 
integral  over  the  sum  will  become  a  sum  of  integrals.  However 
the  sums,  which  were  originally  sums  of  phase  factors  do  not 
converge  uniformly  for  an  infinite  layer  near  the  point  where 
k  =  0.  Near  this  point,  the  terms  in  the  sum  are  all  going  to 
one.  This  lack  of  uniform  convergence  near  k  =  0  prevents 
the  exchange  of  summation  and  integration,  and  thus  some 
small  ball  around  this  point  must  be  removed  from  the  region 
of  integration  and  treated  separately.  The  contribution  from 
this  ball  must  be  treated  in  the  limit  that  the  size  of  the  ball 
goes  to  zero. 

The  result  of  the  analysis  outlined  above  is  that  when  the 
symmetry  is  such  that  the  self-consistent  solution  does  not 
produce  a  net  dipole  moment  pier  unit  volume  with  a  com¬ 
ponent  in  the  plane  of  the  layer,  then  Eq.  ( 1 1 )  reduces  to  a  sum 
of  one-dimensional  integrals  with  only  minor  qualifications.' 
The  integrand  of  the  one-dimensional  integral  with  k"  =  0 
is  singular  at  zero.  The  Cauchy  principal  value  is  to  be  un¬ 
derstood  in  this  case. 

The  expression  for  the  total  energy  of  the  film  is  not  simply 
the  sum  over  filled  states  of  the  band  energies,  as  this  would 
count  the  electron-electron  interaction  energy  twice,  but  is 
given  by 

£  =  ^E  J9l( P’)E.(p'*V 
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+  ^<T>  +-<V>  +  fl,  (18) 
2  2 

where  a  is  the  area  of  the  unit  cell  in  the  layer  and  the  term 
denoted  by  R  is  the  lattice-lattice  interaction.  A  similar  careful 
analysis  of  these  terms  shows  that  except  for  the  T  term,  each 
one  leads  to  sums  of  integrals  plus  residua)  parts  due  to  the 
exclusion  of  the  origin  in  k.  The  singularities  cancel  in  such 
a  way  that  there  may  be  a  contribution  to  the  energy  due  to 
the  explicit  treatment  of  this  excluded  part.  This  contribution 
is  the  multipole  expansion  of  the  classical  electrostatic  po¬ 
tential  due  to  the  edge  of  the  film.  In  the  case  when  there  is 
no  dipole  moment  per  unit  volume  in  the  plane  of  the  layer, 
these  terms  do  not  contribute  in  the  limit  of  an  infinite 
layer. 

The  details  of  the  reduction  of  the  total  energy  expression 
will  be  presented  elsewhere.8,12  There  are  terms  arising  from 
the  electrostatic  energy  of  the  lattice  of  positive  nuclei  bal¬ 
anced  by  a  uniform  negative  background  charge.  These 
Madelung-like  terms  do  not  depend  upon  the  density  matrix, 
and  so  they  may  be  computed  once  and  for  ail  for  a  given 
crystal  geometry.  Then,  there  are  other  terms  which  describe 
the  change  in  the  potential  energy  due  to  the  nonzero  Fourier 
components  of  the  electronic  charge  density.  These  do  depend 
upon  the  solution  through  the  density  matrix. 

IV.  SUMMARY 

The  formalism  presented  above  is  flexible  enough  to  permit 
the  investigation  of  a  large  number  of  interesting  systems.  The 


lattice  size  may  be  expanded  in  order  to  investigate  the  iso¬ 
lated  molecular  limit,  or  the  selvedge  region  may  be  recon¬ 
structed  almost  arbitrarily.  Calculations  are  in  progress  on 
hydrogen  and  will  proceed  to  other  light  elements.  The  next 
major  formal  development  to  be  undertaken  is  the  intro¬ 
duction  of  an  ab  initio  pseudopotential  in  order  to  treat  the 
case  of  atoms  with  cores  in  a  way  which  avoids  computing  a 
large  number  of  integrals. 
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It  has  been  shown  that  the  divergence  or  pathologically  slow  convergence  of  perturbation  expansions 
involving  a  weak  or  none  symmetry  forcing  (Mumll-Show-Musher-Amos,  symmetrized  polaruation  or 
polarization  expansions)  can  be  circumvented  by  using  a  simple  analytic  continuation  procedure.  When 
applied  to  the  interaction  of  a  hydrogen  atom  wuh  a  proton  this  procedure  provides  accurate  values  of  the 
exchange  energy  from  the  knowledge  of  the  polarization  senes  alone.  When  the  ungerade  symmetry  is  forced 
the  symmetrized  polanzation  series  is  shown  to  converge  to  a  spunous.  unphysical  interaction  energy.  The 
true  interaction  energy  can  only  be  recovered  by  the  analytic  continuation  procedure.  This  procedure  provides 
us  also  with  the  information  about  location  of  singularities  of  the  analytic  functions  defined  by  the 
perturbation  series.  Such  information  turns  out  to  be  sufficient  for  an  understanding  of  peculiar  convergence 
properties  of  the  perturbation  expansions  and  their  Pade  approximants. 


I.  INTRODUCTION 

It  is  well  known  that  there  exist  very  many  dilferent 
perturbation  expansions  (or  molecular  interaction  en¬ 
ergies,  usually  referred  to  as  exchange  perturbation 
theories. 1  It  ts  less  known,  however,  that  in  practice 
the  majority  of  them  cannot  be  applied  to  study  interac¬ 
tions  of  many-electron  systems.  This  majority  includes 
all  theories  which  require  solving  perturbation  equations 
involving  the  full  N-electron  antisymmetrizer  .  The 
presence  of  this  operator  prohibits  any  separation  of 
variables  in  perturbation  equations  and  even  for  sep¬ 
arable  Ha  the  first-order  wave  function  would  have  to 
contain  all  many-electron  clusters  up  to  an  iV-elec- 
tron  cluster.  2  Very  recently  Adams3  showed  that  the 
Eisenschitz-London-Hirschfelder-van  der  Avoird 
(EL-HAV)  theory  can  be  modified  in  such  a  way  that  only 
a  partial  antisymmetrization  including  only  single  inter¬ 
changes  appears  in  perturbation  equations.  This  is  an 
essential  simplification  since  the  first-order  wave  func¬ 
tion  then  involves  only  up  to  four-electron  clusters.1 
Nevertheless  such  a  method  would  be  less  practical  than 
the  methods  in  which  „-f  does  not  appear  in  perturbation 
equations  at  all,  i.  e. ,  in  which  the  nth-order  wave  func¬ 
tion  can  be  expressed  in  terms  of  at  most  ( n  +  l)-electron 
clusters.3  It  turns  out  that  this  condition  can  be  satisfied 
only  by  perturbation  formalisms  involving  a  weak  or  no 
symmetry  forcing.  These  are  the  standard  Rayleigh- 
SchrSdinger  (RS)  perturbation  expansion,  referred  to 
as  the  polarization  expansion  after  Hirschfelder,  the 
symmetrized  Rayleigh-SchrOdinger  (SRS)  expansion, 
usually  referred  to  as  the  symmetrized  polarization 
expansion,  and  the  Murrell -Show-Musher -Amos  (MSMA) 
expansion.  The  symmetry  forcing  peculiar  to  these  ex¬ 
pansions  is  characterized  in  more  detail  in  Ref.  S.  The 
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fact  that  the  interaction  energy  in  the  MSMA  theory  can 
be  calculated  without  solving  perturbation  equations  in¬ 
volving  a  projection  operator  is  a  direct  consequence  of 
the  interchange  theorem  proved  in  Ref.  4. 

The  results  of  the  numerical  calculations  for  the  inter¬ 
action  of  a  hydrogen  atom  and  a  proton,  reported  in  Refs. 
5  and  6,  show  that  the  convergence  properties  of  the 
three  expansions  mentioned  above  are  far  from  de¬ 
sirable.  The  convergence  of  the  polarization  expansion 
is  pathologically  slow  (especially  at  large  distances  R), 
and,  moreover,  the  senes  can  provide  us  only  with  the 
energy  of  a  physically  uninteresting  fully  symmetric 
state.  The  symmetrized  polarization  expansion  also 
converges  pathologically  slowly,  and.  when  the  ungerade 
symmetry  is  forced  the  series  does  not  seem  to  converge 
to  any  eigenvalue  of  the  Hamiltonian.  The  MSMA  series, 
on  the  other  hand,  clearly  diverges  to  infinity  in  this 
case.  To  find  a  way  out  of  this  highly  unsatisfactory 
situation  the  usual  (rational)  Pade  approximants  have 
been  calculated  for  these  expansions. 7  It  turned  out  that 
the  Pade  approximants  for  the  MSMA  series  converged 
very  quickly  to  the  exact  energy,  thereby  providing  ef¬ 
fective  analytic  continuation  of  the  series  beyond  its 
convergence  radius.  On  the  other  hand,  the  Pade  ap¬ 
proximation  was  a  complete  failure  in  the  case  of  the  RS 
and  SRS  series. 

In  the  present  paper  we  want  to  show  that  these  two 
series  can  be  very  effectively  summed  using  a  different 
kind  of  analytic  continuation,  based  upon  employing  a 
simple  algebraic  function  having  only  square  root  type 
singularities.  This  procedure,  being  a  particular  case 
of  the  so-called  quadratic  Pade  approximation  introduced 
by  Shafer, '  also  works  very  well  in  the  case  of  the  MSMA 
method  and  can  be  used  to  locate  singularities  of  all  three 
perturbation  series.  The  knowledge  of  the  positions  and 
the  character  of  these  singularities  turns  out  to  be  suf¬ 
ficient  to  explain  the  peculiar  convergence  properties  of 
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the  RS.  SRS,  and  MSMA  expansions  as  well  as  of  the 
corresponding  Pade  approximants. 

The  plan  of  this  paper  is  as  follows.  In  Sec.  II  we 
formulate  the  problem  in  a  rigorous  mathematical  way 
by  defining  interpolation  functions,  i.e.,  analytic  func¬ 
tions  which  after  expanding  them  in  a  power  series  around 
X  =  0  produce  the  perturbation  series  considered  by  us. 

In  Sec.  Ill  these  interpolation  functions  and  their  singu¬ 
larities  are  investigated  for  an  exactly  soluble  model. 

The  information  obtained  from  this  model  is  used  in  Sec. 

IV  to  set  up  an  analytic  continuation  procedure,  which 

is  subsequently  tested  for  the  realistic  system  consisting 
of  a  hydrogen  atom  interacting  with  a  proton.  Section 

V  contains  a  discussion  and  conclusions. 

II.  INTERPOLATION  FUNCTIONS 

When  the  interaction  between  two  subsystems,  denoted 
below  by  A  and  B,  is  completely  neglected,  the  Hamil¬ 
tonian  H0 .  the  wave  function  tl0  ,  and  the  energy  e„  of  the 
system  AB  are  simply  expressed  via  the  Hamiltonians, 
the  wave  functions,  and  the  energies  of  isolated  subsys¬ 
tems:  Ra  =Hq  +H$,  do  =<?o  <Po  >  and€0  =  fo*<f-  To  find 
the  interaction  energy  «  we  have  to  add  the  interaction 
operator  V  to  H0,  accounting  for  the  Coulombic  interac¬ 
tions  between  elementary  particles  of  A  and  B,  and  sub¬ 
sequently,  to  find  the  appropriate  eigenvalue  of  H=Ha 
*  V.  To  solve  this  problem  by  the  RS  perturbation  theory 
we  introduce  a  complex  variable  x,  the  perturbed  Hamil¬ 
tonian  H{\)  =//0  *XV,  and  the  function  fx)  defined  by  the 
eigenvalue  equation 

//(X)<lRS(X)=[€0+€BS(X)]iRS(X)  .  (1) 

The  function  £*s(x)  will  be  referred  to  as  the  RS  inter¬ 
polation  function.  We  know  from  the  Kato  theory’  that 
«"•(*)  exists  and  is  analytic  in  some  neighborhood  of 
X  =0.  The  RS  expansion  is  then  defined  as  the  power 
series  expansion  of  ^(x): 


t-Ec.1T',  (4) 

t>i 

where  R  is  a  measure  of  the  distance  between  A  and  B. 
This  means  that  each  term  in  (4)  can  be  obtained  from 
(3)  in  finite  order  treatment. 14  Particularly,  if  A  and 
B  are  neutral  systems  we  have 

IW 

£-H  =0{R-3m‘i)  ,  (5) 

"•1 

i.e.,  the  mth  order  treatment  gives  all  C,  constants 
from  k  =  1  through  k  =3 m  +2.  It  should  be  stressed,  how¬ 
ever,  that  practical  usefulness  of  the  asymptotic  ex¬ 
pansion  (4)  is  very  limited  since  the  exponentially  de¬ 
creasing  components  of  the  interaction  energy  are  ob¬ 
viously  neglected  by  this  expansion.  Moreover,  al¬ 
though  this  fact  has  not  been  rigorously  proven,  it  is 
generally  assumed  that  the  expansion  (4)  is  divergent 
for  each  value  of  R.14,IS 

The  exchange  perturbation  theories  designed  to  obviate 
the  difficulties  mentioned  above  provide  us  with  many 
other  interpolation  functions  «(x),  having  different  singu¬ 
larities  and  leading  to  different  power  series  expansions: 

c(X)=£  e.V  .  (6) 

*■1 

As  a  matter  of  fact  these  interpolation  functions  are  con¬ 
strained  only  by  two  conditions: 

(1)  €  (0)  =  0  , 

(ii)  €(1)  =  <  , 

and  can  be  defined  in  many  possible  ways  (see  Refs.  1, 

4,  and  16  for  a  review  of  recent  literature).  For 
practical  reasons  it  is  also  sensible  to  require  that 

(iii)  lim  R*(e„  -  «JS)  =0  for  each  fe>0 
x-- 

and 


€"(X)=L  £?V,  (2) 

n*  l 

and  the  interaction  energy  <  can  be  calculated  from 
«  =  eM(l)*Z^.  (3) 

w»l 

Obviously,  in  practice  the  perturbation  corrections  ej3 
are  calculated  directly  from  perturbation  equations'0 
without  the  knowledge  of  c^fx).  The  importance  of  e^tx) 
lies  in  the  fact  that  its  singularities  determine  the  con¬ 
vergence  radius  of  (2),  the  rate  of  convergence  of  the 
series  (3),  and  the  convergence  properties  of  the  Pade 
approximants  to  the  power  series  (2).  The  expansion 
(3)  would  be  particularly  useful  if  the  convergence  radius 
of  (2)  were  greater  than  unity,  i.e.,  if  the  singularities 
of  <“(x)  lay  far  outside  the  unit  circle.  Unfortunately, 
it  is  known11,  u  that  the  polarization  expansion  cannot 
satisfy  this  condition.  Moreover,  as  shown  by  Claverie, 11 
in  the  case  of  more  than  two-electron  systems,  the  series 
(3)  can  converge  only  to  the  energy  of  states  which  are  in¬ 
compatible  with  the  Pauli  exclusion  principle.  It  is  in¬ 
teresting  that  in  spite  of  the  above  drawbacks  the  series 
(3)  generates  the  correct  large  R  asymptotic  expansion 
for  «, 


(iv)  for  separable  tf0,  c,  can  be  calculated  in  practice 
for  many -electron  systems.  Condition  (iii)  guarantees 
that  the  exchange  contributions  «„  vanish  exponentially 
with  R,  i.e.,  that  the  perturbation  expansion  for  c 
does  not  produce  any  long-range  exchange  terms.4  It  is 
tempting  to  also  require  that 

(v)  singular  points  of  «(x)  lie  far  outside  the  unit 
circle. 


Unfortunately  this  condition  seems  to  be  incompatible 
with  (iv)  and  none  of  the  proposed  perturbation  formalisms 
satisfy  both  (iv)  and  (v).  As  a  matter  of  fact,  it  is  the 
purpose  of  this  article  to  show  that  condition  (v)  may  be 
relaxed  in  practice. 


A  convenient  way  to  construct  interpolation  functions 
is  to  use  the  idea  of  symmetry  forcing4,4  and  to  define 
c(X)  by  the  system  of  equations 


£  (X )  =  X 


(<iol  VFl  j (X)) 
(tip  IF*itl(X)) 


(7) 


d  (X)  =  tip  +  Rg  (t  (X )  —  X  I  )  Fj  if  (X)  , 


(8) 


where  R0  is  the  reduced  resolvent  of  H0 ,  and  F,  and  F, 
are  symmetry  forcing  operators  which  can  be  equal  to 
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the  identity  operator,  the  idempotent  antisymmetrizer 
A.  or  some  other  more  complicated  operator. 4  It  can 
be  shown  that  for  =1,  Eqs.  (7)  and  (8)  are  equivalent 
to  the  Schriidinger  equation  (1)  constrained  to  the  sym¬ 
metry  subspace  determined  by  Ei  and  E2.  Therefore, 
til)  -<• 

If  Fx  =FZ  =  1  (no  symmetry  forcing)  Eqs.  (7)  and  (81 
are  equivalent  to  (1)  for  each  x  and  in  this  case  «(x) 
-^(X).  The  choice  Ei  =-^f,  E2  =  l,  classified  as  a  weak 
symmetry  forcing,  leads  to  the  perturbation  expansion 
of  the  MSMA  theory.4  The  interpolation  function  t(X) 

=  <*tSMA(X)  is  then  defined  by  the  equation 

'c0i«*4SUA(x)-xkMi  -/?0(<““A(x)-xr>)-%'=0  ,  (9) 

resulting  after  eliminating  3(x)  from  Eqs.  (7)  and  (8). 
Note,  that  if  we  skipped^!  in  (9)  this  equation  would  de¬ 
fine  the  RS  interpolation  function  ^(x).  We  see  that 
<**(>)  and  ^“‘(x)  are  implicitly  defined  multivalued 
functions  of  the  complex  variable  X.  Their  values  lie 
on  many  branches  which  may  be  obtained  by  analytic 
continuation  of  the  series  (2)  or  (6). 

The  choice  E2*l  gives  a  stronger  symmetry  forcing 
employed,  for  example,  in  the  EL-HAV  (E.  =  E2  =<si)  or 
the  Hirschf elder -SUbey  (Ei  and  E2  are  given  in  Ref.  4) 
theories.  Although  the  strong  symmetry  forcing  seems 
to  guarantee  very  fast  convergence  of  the  perturbation 
expansion.  * 4  it  is  incompatible  with  condition  (iv)  and 
cannot  be  applied  to  studying  physically  interesting  sys¬ 
tems.  Therefore,  strong  symmetry  forcing  expansions 
will  not  oe  discussed  in  this  paper. 


(2)  What  are  the  singular  points  of  the  chosen  branches 

oi  e(X). 

The  answer  to  the  first  question  determines  the  pos¬ 
sible  sums  of  perturbation  expansions.  The  answer  to 
the  second  question  determines  the  convergence  prop¬ 
erties  of  the  series.  It  seems  to  be  impossible  now  to 
calculate  interpolation  functions  exactly,  even  for  very 
simple  systems  like  HI  .  Therefore,  we  shall  try  to 
answer  the  above  questions  studying  an  exactly  soluble 
model  possessing  all  essential  features  of  the  H|  sys¬ 
tem. 

111.  AN  EXACTLY  SOLUBLE  MODEL 

In  1972  Schulman11  considered  a  two-state  model  for 
which  he  was  able  to  prove  the  convergence  of  the  po¬ 
larization  expansion  at  small  values  of  the  internuclear 
distance  R.  This  model  consists  in  the  perturbative  ex¬ 
pansion  of  the  H2  secular  equation  in  the  minimal  {lsA , 
lss}  basis  set,  the  Hamiltonian  matrix  being  partitioned 
as  Ho  +xV,  where  Ho  and  V  are  the  matrices  of  the  opera¬ 
tors  H0  and  V  defined  at  the  beginning  of  Sec.  n.  From 
our  point  of  view  this  model  has  two  essential  draw¬ 
backs.  Firstly,  as  plainly  stated  by  Schulman.  it  is  in¬ 
adequate  at  large  R.  and  secondly,  it  is  trivially  solved 
by  any  symmetry  forcing  perturbation  procedure.  To 
obtain  a  nontrivial  model,  valid  at  large  R,  we  extended 
the  minimal  basis  by  the  polarization  functions  2 pA 
=  ctr-Rx)  and  Zpa  =  -o(r  -RB),  where 

.  (12) 


The  interpolation  function  of  the  symmetrized  polariza¬ 
tion  expansion5  is  defined  explicitly  as 


(do  >’  t'.-W/w(X)> 


•  ,..rt 


where  ;M(.\)  is  defined  by  Eq.  (1)  or  by  Eqs.  (7)  and  (8) 
with  E-,  =  E;  =  1 .  It  may  be  argued  that  the  3RS  method 
cannot  be  useful  for  physically  interesting  states  for 
with ,4 .“(1)  =0  and  e^x)  has  a  singularity  at  the  physi¬ 
cal  value  of  the  expansion  parameter. 5,18  In  fact,  this 
singularity  is  removable,  at  least  for  sufficiently  large 
R.  since,  as  can  easily  be  shown  by  expressing  B/3X 
’  l-  ^(X)!^!  in  terms  of  eigenfunctions  of  W(  1), 


UmR*‘  r“(woi-'4<-M(X))|;i.i=®  for  each  *>0.  (11) 

3-  * 


it  must  be  stressed,  however,  that  ^“(l)  defined  as  a 
limit  of  ^“(x)  for  1  cannot  be  shown  to  be  an  eigen¬ 
value  of  Hi 1)  =  H,  -V.  Thus,  even  if  the  power  series  for 
^"(x)  converges  at  x  =1,  its  sum  need  not  have  a  physical 
meaning  (although  it  must  have  the  same  large  R  asymp- 
'oiic  expansion  as  «).  However,  as  can  be  seen  from 
Eq.  U0>,  ^“(A)  has  a  second  branch  arising  from  that 
branch  of  ,RS(x)  which  satisfies  cM(l)  =^fiR*fl).  This 
branch  of  ^“'x)  is  reguiar  at  x  =  1,  and  <***(!)  =  <  on  it. 


We  have  assumed  here  that  the  vectors  Rx  and  R3, 
specifying  the  nuclear  positions,  lie  on  the  z  axis.  It 
will  be  shown  later  that  due  to  this  particular  choice  of 
u>(r)  the  model  gives  asymptotically  100%  of  the  total 
energy  and  98%  of  the  exchange  energy  for  the  sys¬ 
tem.  At  small  R  this  model  is  also  quite  adequate. 

At  R  =3  bohr  the  energy  of  the  2pu„  state  is  given  with 
only  0. 3%  error. 

The  exact  Schrodinger  equation  for  W2  is  now  replaced 
by  the  four -dimensional  secular  equation 

(Ho  *V)c  =  (e0  -*•<)  Sc  ,  (13) 

where  the  matrices  Hg  ,  V,  and  S  are  now  calculated  with 
{lsA,  Is,,  2  pA,  ZpB }  as  the  basis  set.  Incomplete 
analogy  with  the  infinite -dimensional  case,  this  equation 
can  be  solved  using  the  Rayleigh -SchrSdinger  or  sym¬ 
metry  forcing  perturbation  techniques.  In  the  latter  case 
the  symmetry  projector  is  defined  as5 


Two  questions  concerning  the  interpolation  functions 
now  appear  to  be  relevant: 

(1)  How  are  the  eigenvalues  of  H{  1)  connected  with 
those  of  W  O)  through  the  complex  curves  "tu),  where 
^  distinguishes  between  different  branches  of  <(x). 


Assuming  A  in  the  above  form  we  force  the  ungerade 
symmetry  of  c,  i.e.,  we  consider  the  2pcr„  state  of  HJ. 

The  interpolation  function  yielding  the  polarization 
expansion  for  t  is  an  implicit  function  defined  by  Eft”, 
x)=0,  where 
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FU',  \)=det(Ho  -«0S--XV  -u>S)  .  (15) 

The  MSMA  expansion  for  1  is  obtained  by  expanding  the 
interpolation  function  7*““a(x),  defined  implicitly  by  the 
system  of  equations 


-  iaUA 


=  X 


c;VAc 
c;sac  ’ 


(16) 


c=c0.R0(i“s“*S-\V)c  ,  (17) 

where  c0  is  the  unperturbed  vector  satisfying  (Ho  -  <oS)c0 
=  0,  i.e. ,  ej-d,  0,  0,  0),  and  R0  is  a  matrix  counterpart 
of  the  reduced  resolvent,  normalized  so  that  c;SR0=0,8 

Ro  =  (1  ™  CqC0S)(Hq  —  £qS  ±  CqCq)  1  .  (18) 

Eliminating  c  from  (17)  we  find  that  7“s“A(x)  is  an  im¬ 
plicit  function  defined  by  Cfe1*3**,  a)  =0,  where 

G(w,  A)=c;(wS-xV)A[l -R0(u)S-xV)l-lca  .  (19) 

The  interpolation  function  7SRS(x)  of  the  symmetrized 
polarization  function  is  defined  explicitly  as 


TABLE  1.  Branch  points  determining  the  convergence  proper¬ 
ties  of  the  polarization,  symmetrized  polarization,  and  MSMA 
expansions  in  tne  case  ot  the  exactly  soluble  tour-state  model 
of  Ht.  The  MSMA  branch  points  correspond  to  the  ungerade 
symmetry  forcing. 


R 

XRS 

At> 

XMiMA 

3 

1.  05157834  ±  0.  27924566* 

0.  70574669  ±  0.  02556853 

4 

1. 01538451  ±0.  12900230: 

0.  85643594  ±0.  02067580 

6 

1.00096729  ±0.02537625: 

0.  97345544  ±  0 .  00228434 

8 

1.00004189±0. 00447076: 

0.9954a067  ±0. 00017819 

10 

1 . 00000 147  ±  0 . 00074047: 

0.99925786  i  0.0000 1358 

12.5 

1 . 00000002  ±  0.  00007446: 

0.  99992552  ±  0.  00000062 

for  all  four  values  of  these  functions.  On  the  other 
hand,  among  the  four  values  of  7SBS  (1)  only  two  are  com¬ 
mon  with  7  “(1).  This  is  due  to  the  removable  singularity 
appearing  at  a  =  1  on  those  Riemann  sheets  of  7srs(a)  for 
which  cKS(l)  has  gerade  symmetry.  On  these  Riemann 
sheets  7SBS(1)  does  not  satisfy  Eq.  (13). 


es“(A)=x 


c;VAcm(x) 

c;sac*s(a) 


(20) 


where  c^tx)  is  a  solution  of  Eq.  (13)  in  which  V  has  been 
replaced  by  xV. 


Since  F(u,  X)  and  the  numerator  of  G(ui,  X)  are  fourth- 
order  polynomials  in  ui  with  X  independent  coefficients  at 
w\  the  functions  7“ (A)  and  7“3I4A(A)  are  four-valued 
algebraic  functions  having  only  branch  point  singularities. 
The  SR S  interpolation  function  7srs(a)  must  have  the 
same  branch  point  singularities  as  7ra(X)  plus  some  po¬ 
lar  singularities  due  to  the  presence  of  the  denominator 
in  Eq.  (20).  After  some  algebraic  manipulations  with 
the  rhs  of  Eq.  (19),  it  is  possible  to  show  that  F( ui,  1) 
has  the  same  roots  as  C(w,  1),  i.e.,  7“(1) -7“SMA(1) 


FIG.  1.  Schematic  plot  of  the  two  lowest  branches  of  c ra(X) . 

and  calculated  at  R  =  3.0  bohr  within  the  four- 

state  model  of  Sec.  III.  The  ungerade  symmetry  w-.s  forced 
in  defining  t3ra{\)  and  c“3lAtX).  The  separation  between  two 
branches  of  <sfls(X)  is  highly  exaggerated.  <f  and  <„  denote  the 
interaction  energies  in  the  gerade  and  ungerade  states,  re¬ 
spectively,  and  is  the  "apparent  interaction  energy.  " 


Since  the  matrices  H0 ,  V,  and  S  are  known  functions 
of  R  and  the  quartic  equations  can  be  solved  via  roots, 
the  interpolation  functions  can  in  principle  be  expressed 
analytically  in  closed  form.  However,  the  resulting 
formulas  would  be  enormously  complicated  and  in  practice 
it  is  much  easier  to  calculate  the  roots  of  F(u>,  X)  and 
G(ui,  X)  numerically  for  interesting  values  of  x  and  R. 
Figure  1  shows  the  RS,  SRS,  and  MSMA  interpolation 
functions  obtained  in  this  way  for  R  =3.0  bohr.  We 
present  only  the  two  lowest  branches  of  these  functions 
since  only  these  branches  of  the  exact  interpolation 
functions  can  be  reasonably  well  approximated  by  our 
model. 


Finding  branch  points  of  7 “(A)  and  7*““*  (A)  is  more 
difficult  since  eliminating  u>  from  the  system  of  equa¬ 
tions 


F(u,  X)  =  0  , 


3F 

du; 


(w,  X)  =0  , 


(21) 


and  similarly  for  G(wi,  A),  leads  to  a  twelfth-degree 
resultant  polynomial, 18  the  roots  of  which  are  numerical¬ 
ly  unstable  at  large  R  and,  obviously,  cannot  be  ex¬ 
pressed  in  closed  form.  Nevertheless,  using  the  double 
precision  arithmetic  of  the  DEC  20/40  computer  we 
were  able  to  find  these  roots  numerically  up  to  R  =  12.  5 
bohr.  The  branch  points  calculated  in  such  a  way  are 
shown  in  Table  I.  We  list  only  those  branch  points  which 
are  directly  related  to  the  convergence  properties  of  the 
perturbation  expansions.  These  are  the  branch  points 
which  are  closest  to  the  origin  and  lie  on  the  Riemann 
sheet  of  7“  (A)  or  7“S1,A(.\)  for  which  7(0)  =0. 


To  complete  the  description  of  our  model  we  report 
the  large  R  asymptotic  expansions  for  two  lowest  en¬ 
ergies  and  for  the  relevant  branch  points  of  7*S(A) 
and  7‘““A(\): 

7  =  _  «-/?-«  .  24R*7 -5£  FT* -OCR'10) 

iFF*  [Hi  -HJ/T* -OtR-1)]  ,  (22) 

ReA“=l-RVl*[$f$S3*Ol/r')]  .  123) 
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♦  Im  \ 


X 


RsX 

l-K/e,  1  1-K/e0 

r:c.  2.  The  location  of  the  branch  points  determining  the  con- 
ergence  properties  of  the  RS.  SRS,  and  MSMA  expansions  at 
M.e  intemuclear  distance  R  *  12.  a  bohr.  The  crosses  denote 
;B.  the  empty  dots  xif34*,  corresponding  to  the  ungerade  sym¬ 
metry  forcing,  and  the  black  dots  xl®2*  corresponding  to  the 
.erade  symmetry  forcing.  The  empty  triangle  denotes  the 
.'.ystcal  value  of  the  expansion  parameter  X  =  1.  The  dashed 
::rcle  has  a  radius  K/(.;,  where  K  is  the  exchange  energy,  and 
as  oeen  drawn  around  the  point  X  *  l. 

Imxf  ,  (24) 

xy“A.l 

tOiR"*)]  .  (25) 

The  MSMA  branch  points  given  above  correspond  to  the 
.nterpolation  function  obtained  by  forcing  the  ungerade 
symmetry  of  the  wave  function.  The  branch  points  cor- 
:esponding  to  gerade  symmetry  forcing  can  be  obtained 
rom  Eq.  (25)  by  inverting  the  sign  of  the  exponential 
art.  The  asymptotic  location  of  the  RS  and  MSMA 
'ranch  points  is  shown  in  Fig.  2. 

The  formulas  (22)  —  (25 )  have  been  obtained  perturbative- 
■  v  by  setting  e '*  =  /a  and  expanding  F,  G,  7,  X, ,  and  a>,  as 
;cwer  series  in  4.  Coefficients  in  these  series  are 
regular  or  have  at  most  polar  singularities  at  R  =  «°, 

-iter  expanding  them  in  powers  of  R'1  we  obtain  double 
power  series  (in  u  and  R *')  for  7,  xj3,  and  X?S“A,  the 
.eading  terms  of  which  are  just  presented  in  Eqs.  (22)- 
251.  Since  the  analytical  dependence  of  the  functions 
F  and  G  upon  w,  \,  R,  and  /a  is  extremely  complicated, 

‘.he  formulas  (.22) —(25)  have  to  be  derived  automatically 
by  a  computer.  In  fact  all  derivations  leading  to  Eqs. 

22) — (25),  including  integrations  in  evaluating  matrix 
elements,  matrix  operations  in  Eqs.  (15)— (19),  and  dif¬ 
ferentiation  in  Eq.  (21),  were  performed  automatically 
ov  ;he  REDUCE  program  for  algebraic  manipulations 
xr.d  took  about  1  h  of  CPU  time  of  the  DEC  20/40  com¬ 
puter. 

To  appraise  the  accuracy  of  our  model,  Eq.  (22)  may 
-e  compared  with  the  exact  asymptotic  expansion  for  the 
•nteraction  energy1*: 

<  =  -\-R-*  -fR'* 

xRe'*(- *-R'1  -  ~  H-^O^R'*))  .  (26) 

\e  e  ie  } 

is  seen  that  the  leading  R*4  term  in  the  asymptotic 
expansion  of  the  interaction  energy  is  given  exactly  in 
ur  model  and  the  first  term  in  the  asymptotic  expansion 
f  the  exchange  splitting  is  in  error  by  less  than  2%. 

The  adequacy  of  our  four-state  model  is  also  confirmed 


by  the  fact  that  the  leading  term  in  our  asymptotic  ex¬ 
pansion  for  Imx“  represents  98. 1%  of  the  exact  value 
obtained  by  Whitton  and  Byers  Brown. 12 

We  have  also  calculated  individual  corrections  in  the 
RS,  SRS,  and  .MSMA  perturbation  expansions  for  7. 

Since  the  convergence  pattern  we  found  was  completely 
analogous  to  that  obtained  in  high  accuracy  calculations 
of  Ref.  5,  these  corrections  are  not  reported  here. 

Now  let  us  summarize  the  main  implications  of  the 
exact  solution  presented  above.  First,  we  note  that  for 
all  distances  considered,  and  asymptotically  at  large  R, 
the  convergence  radius  of  the  polarization  expansion 
p*8  =  1x5*1  is  greater  than  unity,  hence  this  expansion  is 
convergent  for  our  model.  It  is  interesting,  however, 
that  at  large  R  we  have  =Rexfs  ~0{R2  e'**),  i.  e.,  the 
convergence  radius  is  determined  by  the  real  part  of 
x".  Thus,  the  estimation  of  Imxf*  alone12  is  not  suf¬ 
ficient  to  prove  the  convergence  of  the  polarization  ex¬ 
pansion.20  Since  at  large  R  the  convergence  radius  is 
almost  equal  to  unity,  the  polarization  senes  must  con¬ 
verge  extremely  slowly,  which  is  really  observed  in 
numerical  calculations. 8  Moreover,  as  can  be  seen  in 
Fig.  2,  the  physical  value  of  the  expansion  parameter 
X  =  1  lies  at  large  R  almost  exactly  on  the  line  joining 
the  branch  points.  According  to  Nuttall’s  results, 21  in 
such  a  situation  the  Pade  approximants  cannot  be  ex¬ 
pected  to  improve  the  convergence  of  a  perturbation 
series. 

Now  let  us  consider  the  symmetrized  polarization  ex¬ 
pansion.  Since  the  poles  of  7!RS(X)  are  far  outside  the 
unit  circle  and  the  branch  points  of  7SR3(X)  and  7RS(x)  are 
the  same,  we  have  pSR*  =pRS,  i.e. ,  the  SRS  expansion  is 
convergent,  independently  of  the  symmetry  forced  by  A. 
However,  cf.  Fig.  1,  if  A  forces  the  ungerade  symmetry, 
the  symmetrized  polarization  converges  to  some  un¬ 
physical,  “apparent  interaction  energy”  resulting  from 
removing  the  singularity  at  a  =  1.  The  actual  value  of  the 
interaction  energy  lies  on  the  second  branch  of  7SB3(x) 
resulting  from  that  branch  of  c(x)  which  has  ungerade 
symmetry  at  x  =1.  Although  this  second  branch  also 
passes  through  zero,  its  power  series  at  x  =  0  can  be  ob¬ 
tained  only  from  an  excited -state  perturbation  treatment. 
As  will  be  shown  in  the  next  section  a  proper  analytical 
continuation  procedure  enables  us  to  calculate  the  true 
value  of  the  interaction  energy  from  the  perturbation 
series  converging  to  a  spurious  one. 

The  exact  solution  of  our  model  shows  that  the  "ap¬ 
parent'1  and  true  interaction  energies  are  very  close, 
even  at  small  R.  At  large  R,  as  a  consequence  of 
Ahlrichs’  theory14  both  these  energies  must  have  the 
same  R*1  expansion.  What  is  more,  the  difference  of 
these  energies  appears  to  vanish  faster  than  the  ex¬ 
change  energy  itself.  It  must  be  noted,  however,  that 
between  \=0  and  X  =1  the  two  lowest  branches  of  7SM(X) 
are  generally  very  well  separated  and  their  difference 
does  not  vanish  at  large  R. 

The  most  striking  feature  of  the  MSMA  interpolation 
function  is  the  presence  of  real  branch  points  in  the 
vicinity  of  A  =  l.  If  A  forces  ungerade  symmetry,  these 
branch  points  lie  to  the  left  of  the  point  A  =  1  and  produce 
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a  characteristic  gap  in  the  real  values  of  «*““a(a).  Ob¬ 
viously,  between  the  branch  points  the  function 
takes  on  complex  values,  which  are  not  shown  in  Fig. 

1.  The  MSMA  series  cannot  be  convergent  in  this  case 
and,  indeed,  it  diverges  quickly  to  infinity  at  small  R. 

At  large  R,  the  branch  points  lie  very  close  to  A  =  1  and 
this  divergence  is  extremely  slow.  Note  that  the  point 
A  =  l  always  lies  outside  the  line  joining  the  branch  points, 
see  Fig.  2.  One  can  expect,  then,  that  the  Pade  ap- 
proximants  to  7““a(a)  will  converge  at  A  =  1.**  This  is 
really  observed  in  practical  calculations. 7  One  may 
wonder  how  a  Pade  approximant,  which  is  a  single-valued 
function,  selects  the  right  value  of  the  multivalued  func¬ 
tion  <“*“*( a).  In  fact,  the  Pade  approximants  converge 
to  a  single-valued  function  obtained  from  c“sma(a)  by 
introducing  a  cut  between  the  branch  points.22  Along  the 
cut  this  single-valued  function  is  discontinuous  and  the 
Pade  approximants  are  divergent.22 

In  Ref.  7  it  was  noted  that  the  divergent  MSMA  series 
can  be  summed  using  the  geometric  approximation  or, 
in  other  words,  a  sequence  of  [.V/l]  Pade  approximants. 
We  see  now  that  this  can  be  done  only  approximately  and 
the  conjecture7  that  <ms“a(a)  has  a  simple  pole  Close  to 
A  =1  is  not  confirmed.  The  high  accuracy  of  the  geom¬ 
etric  approximation  is  simple  to  understand  in  this  case 
since  a  pair  of  closely  lying  branch  points,  when  viewed 
from  a  distance,  looks  very  much  like  a  simple  pole. 

This  fact  is  well  illustrated  by  the  formula 

v  {a  -  x  rj*  =  (a  -  A)  ±  {rf  -jp*  — ■  +  •  •  •  , 

(27) 

where  the  branch  points  are  at  a  ±  rj  or  a  ±iri  and  rj  is  a 
small  parameter. 

IV.  ANALYTIC  CONTINUATION  OF  INTERPOLATION 
FUNCTIONS 

The  exact  solution  of  our  four -state  model  presented 
in  the  preceding  section  shows  that  the  interpolation 
functions  connect  eigenvalues  of  H( 0)  and  H(  1)  in  a  quite 
nontrivial  way.  Moreover  the  singularities  of  these 
functions  lie  in  the  immediate  neighborhood  of  the 
physical  value  of  the  expansion  parameter,  affecting 
unfavorably  the  convergence  of  the  perturbation  ex¬ 
pansions.  In  practice  what  we  can  calculate  for  real 
systems  are  only  some  low -order  perturbation  cor¬ 
rections,  i.  e.,  some  low-order  derivatives  of  «(a)  at 
A  =0.  In  this  section  we  shall  show  that  this  information 
is  sufficient  for  a  practical  reconstruction  of  <(A)  at 
A  *1.  Such  a  reconstruction  can  be  carried  out  only  if 
we  have  qualitative  information  about  singularities  of 
< (A) .  Therefore,  we  assume,  in  full  accord  with  the  ex¬ 
act  solution  for  the  four -state  model,  that  within  the 
unit  circle  and  its  closest  vicinity  the  interpolation  func¬ 
tions  may  have  only  first-order  branch  points.  The 
power  series  for  <(A)  can  then  be  continued  analytically 
in  this  region  using  the  algebraic  function  of  the  form 

{iV,  A/}  (A)  =p,(A)  *[p2„(A)  +?,(A)]1/2  ,  (28) 

where  ps[ A)  +px  A  ♦  •  •  •  a"  and  q„{ A)  *qx A 

*  •  •  •  -<?»  a".  The  polynomials  p*( a)  and  qK( A)  can  be 
uniquely  determined,  requiring  that  the  derivatives  of 


{iV,  A/}  (a)  taken  at  a  =  0  are  the  same  through  the  (.V  -.1/ 

+  l)th  order  as  the  known  derivatives  of  cU).  This  con¬ 
dition  and  the  identity  \.V,  ,v}2(a)  =2{.V,  a/}  (A)p,(A)  -<7*(A) 
immediately  gives  the  following  system  of  linear  equa¬ 
tions  for  p0  ,  px . P y  And  q0  ,  . . . q„  : 

aunt*,  4V)  i 

2  ^  *7.  «<  «*-,•  for  0  •=««.!/, 

1*0  1.0 

(29) 

V  * 

2  Pi  <».,  =  S  for  .\1  +1  «  ,V*.V  +  1  . 

i>0  i«0 

For  simplicity  we  assume  here  that  .VI»,V.  Since  {.V, 
A/}(A)  is  a  double-valued  function  we  obtain  two  values  of 
{N,  Af}(l)  from  one  series  of  perturbation  corrections 
We  denote  the  larger  value  by  {.V,  A/},  and  the  smaller 
one  by  {.V,  A/}..  For  the  RS  and  MSMA  theories  both 
values  are  expected  to  correspond  to  the  eigenvalues  of 
H(l),  whereas  in  the  case  of  the  SRS  method  one  of  them 
should  correspond  to  a  spurious  interaction  energy. 

The  analytic  continuation  procedure  described  above 
is  a  particular  case  of  a  generalization  of  the  Pade  ap¬ 
proximation  proposed  by  Shafer*  and  referred  to  as 
“quadratic  Pade  approximation."  The  Shafer  approxi¬ 
mants  [ff,  L,  Af](A)  are  defined  as  roots  of  the  general 
quadratic  equation  with  polynomial  coefficients: 

pK{ A)[AT,  L,  A/]2(A)*7i(A)[A,  L,  A/](A)  +ru  (A)  =0  , 

(30) 

the  polynomials  pK( A),  qL( A),  and  r*( A)  being  chosen 
such  that  c(a)  -  [A,  L,  Afj(A)  =0( AA>i''",).  It  is  easy  to 
see  that  {N,  A/} (A )  =  [ O ,  .V,  Af](A).  Besides  first-order 
branch  points  the  Shafer  approximants  may  have  poles 
and  inverse  square  root  singularities.  These  singulari¬ 
ties  are  not  expected  to  be  important  in  our  case,  ex¬ 
cept  possibly  for  the  SRS  method. 

To  test  the  effectiveness  of  the  {S’,  A/}  approximation, 
we  calculate  the  {.V,  .V  -1}  approximants  for  the  RS,  SRS, 
and  MSMA  series  using  highly  accurate  values  of  the  per¬ 
turbation  corrections  obtained  recently  for  the  interaction 
of  a  hydrogen  atom  and  a  proton. 5,5  In  Tables  n-VII  the 
accuracy  of  this  procedure  is  compared  with  the  accuracy 
of  corresponding  sums  of  perturbation  expansions,  and  in 
the  case  of  the  MSMA  method,  with  the  accuracy  of  the 
rational  Pad6  approximation.  The  tabulated  numbers 
show  the  percent  error  resulting  from  the  approximate 
procedure  indicated  in  the  first  row: 

percent  error  -  100(<wtfollroM#  —  tuesm,'  tucuru*  >  '.31) 

where  by  e,ccur,„  we  mean  the  variational  energy  obtained 
with  the  basis  set  used  to  calculate  perturbation  correc¬ 
tions.  This  variational  energy  is  practically  equal  to  the 
exact  energy  of  Table  VI  of  Ref.  5. 

The  branch  points  reported  in  Tables  n-VII  have  been 
obtained  by  solving  equations  py(A,)  *7*. ((A,)  =0  and  esti¬ 
mating  the  limit  ,V-«.  The  effectiveness  of  this  proce¬ 
dure  was  checked  previously  for  the  four -state  model 
where  the  branch  points  could  be  calculated  directly  from 
Eq.  (21). 

It  is  worthwhile  to  note  that  if  we  knew  the  coefficients 
of  the  power  series  expansion  of  <(A)  on  the  second 
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Riemann  sheet,  they  could  be  used  in  calculating  the  poly 
normals  f>*(x)  and  q„(X).  More  specifically,  p„(x)  and 
could  then  be  found  directly  by  performing  the 
power  series  expansion  of 


P(X)*i[l<(X)+l€(X)] 

(32) 

q(\)  =  -  1<(X)3<(X)  . 

(33) 

respectively,  where  't(X)  and  2<(x)  represent  the  two 
branches  of  c(x)  which  branch  in  the  neighborhood  of 
A  =1.  Such  a  method  of  calculating  py(\)  and  ?*(x)  has 
been  recently  suggested  by  Whitton.  w  In  our  opinion 
the  above  method  is  not  particularly  useful  in  calculating 
interaction  energy  since  the  method  requires  an  ex¬ 
cited  state  perturbation  treatment23  for  studying  ground 
state  interactions. 

V.  DISCUSSION  AND  CONCLUSIONS 

The  results  presented  in  Tables  II  and  in  clearly 
show  that,  contrary  to  a  generally  accepted  belief,  the 
standard  RS  polarization  series  can  provide  us  with  the 
information  about  the  accurate  value  of  the  exchange  part 
of  the  interaction  energy.  Moreover,  this  exchange  part 
of  the  interaction  energy  can  be  calculated  not  only  for 
the  state  to  which  the  series  eventually  converges,  but 
also  for  the  other  state  having  the  same  i/R  asymptotic 
expansion  of  the  interaction  energy.  It  should  be  stressed 
here,  however,  that  to  extract  the  exchange  energies 
from  the  polarization  series,  full  knowledge  of  the  charge- 
overlap  effects34  is  necessary.  This  means  that  the  per¬ 
turbation  corrections  must  be  calculated  accurately 
without  using  the  multipole  expansion. 

The  usual  rational  Pade  approximants  are  not  reported 


TABLE  II.  Convergence  of  the  polarization  senes  and  the 
,v,.v  -  1}  approximants  thereto  tor  the  interaction  of  a  hydro¬ 
gen  atom  and  a  proton  at  the  distance  of  R  »  3.  0  bohr.  The 
listed  values  show  the  percent  error  as  defined  in  Eq.  (31). 

The  error  of  e,  ••+«!*. j  and  {.V..V  +  l}_  is  calculated  relative 

to  the  energy  of  the  lso{  state  and  the  error  of  {,V,.V  +  l}.  rela¬ 
tive  to  the  energy  of  the  state.  For  ,V  *  1  and  .V  *  2  the  ap¬ 
proximants  do  not  exist  since  pv ( 1 )  ■‘■q.y.i(l)  is  negative.  The 
empty  places  mean  that  the  error  is  smaller  than  0  00001%. 


V 

C  ,  *•••♦<  ;.*♦  2  * 

{N,.V*l}. 

{N,.V*1}. 

K4 

1 

-  12.  035 

0. 98247 

36.75500 

23.50960 

4 

-4.  039 

-0.37492 

14.30340 

1 3 .  30850 

3 

0.407 

-0.06958 

2.62515 

1.62738 

D 

2.  ISO 

-0.00418 

-0.84357 

-0.53277 

7 

2.  398 

-0.  00254 

-0.59975 

-0.37862 

i 

1.  348 

-0.00080 

-0.09743 

—  0. 06 165 

3 

1 . 070 

0.00004 

-0.  10517 

-0.  06621 

0.  3*5 

0.00005 

-0  07919 

-0.04985 

-0.  333 

-0.00797 

-0.00502 

i-i 

-0.304 

-0.00168 

-0.  00106 

*> 

—  0.  5 

-0.00004 

-0.00003 

;  5 

0.  067 

'The  convergence  radius  of  the  series  equals  1. 09300776  and  is 
Pete  rmined  by  tne  branch  points  xf3*  1.056 16161  * 0 . 28140674!. 
’This  column  shows  the  percent  error  resulting  when  the  ex- 
.nange  energy  K  *<«,-<»)/ 2  is  approximated  by  ({,V,N*l}. 

-  .V..V-  U.)  2. 
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TABLE  HI.  Convergence  of  the  polarization  senes  and  the 
{.V , S  +  l}  approximants  thereto  for  the  interaction  of  a  nydro- 
gen  atom  and  a  proton  at  the  distance  of  R  *  12.  5  bohr.  The 
listed  values  show  the  percent  error  as  defined  in  Eq.  (31). 
The  error  of  t,  ••  and  {V..V  +  l}_  is  calculated  relative 

to  the  energy  of  the  Iso,  state  and  the  error  of  {.V..V  *•  l}.  rela¬ 
tive  to  the  energy  of  the  2p<T„  state.  The  dashes  mean  that  the 
approximants  do  not  exist  since  p$,(l)  +  <?.*,, (1)  is  negative. 

The  {1,2}  and  {2,3}  approximants  are  very  poor  and  are  not 
shown. 


.V 

-..V..V.1}. 

{.V..V  .  l). 

X* 

3 

-  26 .  YOU5 

-  26.0959489 

-2.4254608 

-47  8333651 

4-8 

-  26.  70 

_ 

— 

— 

3 

-26.694 

0.  1435192 

-0.3140393 

0.5420119 

10 

-20.691 

-  0. 0020539 

-0.  0004052 

-  0.  0034696 

12 

—  26. ub7 

-0.0024142 

-0  0052102 

0.  0000209 

14 

-26.683 

-0.000447* 

-0  000965b 

0  0000039 

lb 

-  26. 679 

0  UOOOOOb 

0.0000020 

-  0  0000006 

18 

-  26  675 

-0  0000001 

0  0000002 

-0  0000004 

*The  convergence  radius  of  the  series  equals  1.00000003  and  is 
determined  by  the  branch  points  X33-  l.  00000003*0.  00007550i. 
“See  footnote  b  of  Table  H, 

“The  exchange  energy  represents  26.725%  of  the  interaction 
energy  in  the  gerade  state. 


in  Tables  n  and  III  since  they  are  not  able  to  improve  the 
convergence  of  the  polarization  series.  For  small  R  the 
improvement  is  marginal  and  for  large  R  none.  This  is 
due  to  the  fact  that  at  large  R  the  physical  value  of  the 
expansion  parameter  x  =  1  lies  almost  exactly  on  the 
cut  chosen  by  the  Pade  approximants  to  make  e^U) 
single  valued,  cf.  Fig.  2.  Along  this  cut  the  Pade  ap¬ 
proximants  cluster  their  poles  and  are  not  expected  to 
converge.33  In  fact,  one  of  these  poles  is  located  ex¬ 
actly  in  the  middle  of  the  cut  very  close  to  X  =1.  This 
makes  the  Pade  approximants  numerically  unstable. 

The  results  obtained  for  the  symmetrized  polarization 
series  are  shown  in  Tables  IV  and  V.  Unfortunately,  for 
the  internuclear  distance  12.  5  bohr  we  were  not  able  to 


TABLE  fV.  Convergence  of  the  symmetrized  polarization 
series  sad  the  {V,,V  +  l}  approximants  thereto  for  the  interac¬ 
tion  of  a  hydrogen  atom  and  a  proton  at  the  distance  3.0  bohr. 
The  ungerade  symmetry  is  forced.  The  listed  values  show  the 
percent  error  relative  to  the  accurate  energy  of  the  2p<j„  state, 
see  Eq.  (31). 


.V 

{V..V  +  1}. 

{.V..V  +  1}. 

i 

0.  51196 

0.  06388 

0.  28121 

2 

0.62205 

8.27935 

0.69223 

3 

0.68335 

0.  74671 

0.  03984 

4 

0. 72382 

0.  67561 

0.  26414 

5 

0. 74887 

0. 75109 

-0. 74943 

6 

0.  76233 

1 .  08485 

0. 73836 

7 

0.76778 

0.75872 

0.06621 

S 

0.  76839 

0.  76036 

0.  07705 

9 

0.  76659 

0. 76058 

-0.  00569 

10 

0. 76407 

0.  76057 

0  .  00  925 

12 

0. 76028 

0. 76057 

0. 00021 

14 

0.75935 

0.76057 

0.00075 

16 

0.  75996 

0.  76057 

0.00010 

18 

0-6063 

0. 76057 

o  ooeoi 

*The  convergence  radius  of  the  series  equals  1. 09300776  and  is 
determined  by  the  branch  points  X®*»  1 . 056 16 16 1  t  0. 23 1404741 
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TABLE  V.  Convergence  of  the  symmetrized  polarization  series 
and  of  the  (AT. JV..V]  Shafer  approximates  thereto  for  the  inter¬ 
action  of  a  hydrogen  atom  and  a  proton  of  the  distance  of  12.5 
bohr.  The  ungerade  symmetry  is  forced.  The  listed  values 
show  the  percent  error  relative  to  the  accurate  energy  of  the 
2 per,  state,  see  Eq.  (31).  The  empty  places  indicate  that  the 
error  is  smaller  than  0.000001%.  The  approximants  (1, 1,1) 
and  (2,2,21  are  very  poor  and  are  not  shown. 


I.V..V..VI. 

1N..V..VI. 

3 

-0.01571 

-386.  96818 

-0.016457 

4 

-0.01638 

-  1223.36532 

-0.016453 

5 

-0.01645 

-0.02580 

0. 146426 

6 

-0.01645 

-0.04186 

0.  020906 

7 

-0.01646 

-0.03270 

-0.000175 

s 

-0.01646 

-0.  03291 

0.000036 

9 

-0.  01646 

-0.  032 37 

-0.000004 

10 

-0.01646 

-0.03287 

11 

-0.01647 

-0.03287 

12 

-0.01647 

-0.  03287 

The  convergence  radius  of  the  series  equals  1.00000003  and 
is  determined  by  the  branch  points  Xj*s»  1 . 00000003 
1 0.00007590«. 


attain  a  good  convergence  with  the  {,V,  ,V/}  approximants. 
For  this  reason  we  calculated  the  full  Shafer  (.V,  .V,  .Vj 
approximants.  These  turned  out  to  converge  very  well 
and  are  shown  in  Table  V  instead  of  simpler  {.V,  .V  *l} 
approximants.  The  success  of  the  Shafer  approximants 
might  suggest  that  the  poor  convergence  of  {.V,  .V  *l}  is 
caused  by  some  polar  singularities  of  ^“(X).  Unfortu¬ 
nately,  in  the  case  of  the  four-state  model  of  HJ  we  did 
not  find  such  singularities  in  the  vicinity  of  X  =  1. 

The  results  presented  in  Tables  IV  and  V  convincingly 
show  that  the  SRS  series  converges  to  the  "apparent  in¬ 
teraction  energy"  resulting  from  the  removable  singulari¬ 
ty  at  x  =  1.  In  spite  of  that,  the  actual  value  of  the  inter¬ 
action  energy,  corresponding  to  a  different  Riemann  sheet 
of  «3M(X),  can  be  very  accurately  calculated  by  continu¬ 
ing  the  SRS  series  analytically  using  a  proper  algebraic 
function. 

It  is  worthwhile  to  note  that  at  large  R  the  symmetrized 
polarization  series  converges  to  the  “apparent  interac¬ 
tion  energy”  in  a  very  nonuniform  way.  As  can  be  seen 
in  Table  V  the  series  converges  quickly  only  to  the 
arithmetic  mean  of  the  true  and  "apparent"  interaction 
energies.  Afterwards,  the  convergence  becomes  ex¬ 
tremely  slow  and  the  remaining  part  of  the  "apparent 
interaction  energy”  is  practically  not  recovered.  This 
convergence  pattern  is  quite  similar  to  that  observed  in 
the  RS  theory.  Particular iy,  the  residual  exchange  en¬ 
ergy  in  the  SRS  theory,  noted  in  Ref.  5,  is  an  analog  of 
the  exchange  energy  in  the  RS  theory  and  can  be  rigorous¬ 
ly  defined  as  one-half  of  the  difference  between  the  true 
and  "apparent”  interaction  energies. 

Note  that  the  peculiar  convergence  properties  of  the 
RS  and  SRS  expansions  are  due  to  the  fact  that  the  phys¬ 
ical  value  of  the  expansion  parameter  lies  almost 
exactly  between  the  branch  points.  This  can  be  seen  in 
the  following  way.  Since  at  large  R  the  relevant  branch 
points  of  <(x)  [e(x)  stands  here  for  <M(x)  and  <*",(x))be- 
have  as  1  tin,  where  n  >  0  decreases  exponentially  with 


increasing  R,  the  sum  of  series  (6)  can  be  represented 
as 

£  e„x"  =/(X)+g(X)  Hi  .  (34) 

w  1 

The  functions /(x)  and^(x)  are  assumed  to  be  analytic 
within  a  circle  of  the  radius  much  greater  than  p"5  and 
can  be  expanded  as  power  series  converging  quickly  for 
X  =  1.  Assuming  that^(l)<0  the  two  values  of  «(1)  can 
now  be  written  as  «(1)  =/(l)*  Igtt),  where  the  minus  sign 
corresponds  to  the  sum  of  the  series  in  (34). 

Applying  Eq.  (27)  and  expanding  the  rhs  of  Eq.  (34)  as 
a  power  series  in  x  we  easily  find  that 

€„  =/,  +g*  -gn-t  £  gk ,  (35) 

fcwO 

where  f„  and  g„  are  the  coefficients  in  the  power  series 
expansions  of  /(x)  and .g(x),  respectively.  Formula  (35) 
shows  that  both  the  polanzationand  symmetrized  polariza¬ 
tion  expansion  can  be  thought  of  as  a  sum  of  two  series. 
The  first,  summing  only/,  +g„  -g^i ,  converges  quickly 
to  r (1 ),  i.e. ,  to  the  arithmetic  mean  of  the  two  lowest 
values  of  c(l).  The  second  one,  converging  to  the  ex¬ 
change  energy  K  -  7jg(l)  is  built  up  of  terms  which  for 
large  n  and  R  behave  as  q2g(l)  2.  i.e. ,  which  are 
practically  constant.  The  number  of  such  terms  neces¬ 
sary  to  recover  A'  would  have  to  be  greater  than  2jj‘1, 
i.e.,  would  have  to  increase  exponentially  with  R. 

If  n  is  large  enough  that  the  series  /„  ~g„  -,g„.i  has 
already  converged  to  its  limit,  then  c„  *  rfa( 2)2,  and, 
consequently  «,  =  rjA'  2.  We  found  numerically  that  at 
R  =12.  5  bohr,  e„  =  -0. 132408 x  10'*  hartree  for  22  *  n 
«  38  (see  also  Ref.  6).  Using  the  exact  value  of  A 
=  -0.  34891  x  10"4  hartree  we  find  that  ij  =0. 75898  x  1CT4 , 
which  agrees  very  well  with  the  value  given  in  Table 
III.  It  may  also  be  shown  that  g'(l)  =  -  2  »0(R’M,  in¬ 
dependently  of  the  dimension  of  the  model  used.  There¬ 
fore,  at  large  R  we  have  c„  =  -  A'2.  This  is  a  remarkable 
result.  It  shows  that  the  exchange  energy,  and  conse¬ 
quently  the  frequency  of  the  resonance  tunneling  between 
two  Coulomb  wells,  can  be  estimated  if  we  know  the 
large  n  behavior  of  the  nth-order  energy  in  the  polariza¬ 
tion  series. 

As  we  have  already  mentioned,  the  usual  Pade  ap¬ 
proximants  are  not  effective  in  coping  with  the  con¬ 
vergence  defects  of  the  polarization  expansion.  The 
contrary  is  a  case  in  the  MSMA  theory,  where  the  point 
x  -  1  lies  well  outside  the  line  joining  the  branch  points, 
cf.  Fig.  2,  and  the  divergence  of  the  senes  is  very 
effectively  circumvented  by  the  usual  Pade  approximation, 
see  Tables  VI  and  VII.  It  may  be  checked  that  the  [.V 
.V  - 1  j  approximants  converge  to  that  value  of  eMSM*ll) 
which  can  be  reached  from  X  =0  without  passing  between 
branchpoints,  i.e..  without  crossing  the  cut  that  the 
Pade  approximation  introduces  to  mane  g“s“A(X)  single 
valued.  Since  the  {.V,  .V  -1}  approximants  are  double¬ 
valued  functions  they  can  converge  to  both  values  ol  t(T>. 
It  turns  out,  however,  that,  whereas  the  convergence  to 
the  energy  of  the  ungerade  state  is  excellent,  the  con¬ 
vergence  to  the  energy  of  the  gerade  state  is  very  poor 
and  the  corresponding  approximants  are  not  shown  ir.  the 
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■uies.  This  asymmetry  is  perhaps  not  very  surprising 
■.ce  tn  the  very  definition  of  «“SMA(X)  the  ungerade  syra- 
t?;ry  forcing  was  employed. 


At  large  R  the  divergence  of  the  MSMA  expansion  is  ex- 
emely  slow  and  very  accurate  values  of  the  interaction 
•Argy  can  be  obtained  in  a  low -order  treatment.  Par- 
.ularty,  at  R  =  12.5  bohr  the  interaction  energy  can  be 
;..ulated  in  such  away  with  the  error  of  only  -0.00409%. 

.s  can  easily  be  understand  when  we  know  the  relevant 
.anch  points  of  e“SUA(X).  Since  at  large  R  these  branch 
..its  are  l  -  q* *  <5,  where  q  is  the  same  as  in  the  RS 
r.eory  and  5  «  q,  the  sum  of  the  MSMA  series  can  be 
presented  as 


V  ^S“A.\"  =  s (x)  +t(\)  V(1  -  ti-  X)2  -  S2  ,  (36) 

here  I X!  <  p*s“A  =  1  —  rj  —  5  and  we  assume  that  the  sin¬ 
gularities  of  s(x)  and  f(x)  are  far  outside  the  unit  circle. 
Continuing  the  rhs  of  Eqs.  (36)  analytically  to  the  point 
=1.  we  find  that  the  interaction  energy  in  the  ungerade 
state  can  now  be  written  as 

e=s  (l)-f(l)vbs  - 5*  3>s(l)-<(l  Jr)-*-  t(  1)  .  (37) 

Note,  that  we  had  to  take  a  minus  sign  for  the  square 
root  in  Eq.  (36)  because  f(x)  is  assumed  to  be  negative 
tor  0  «  X  « 1.  Since  6  is  very  small  we  can  employ  Eq. 
37)  and  write 


7A3LE  VI.  Convergence  of  the  MSMA  series  and  of  the  Padfi 
..".a  -.V,.V  *  t}  approximants  thereto  for  the  interaction  of  a  hy- 
lrogen  atom  and  a  proton  at  the  distance  3.  0  bohr.  The  un- 
.erade  symmetry'  is  forced.  The  listed  values  show  the  per- 
.er.t  error  relative  to  the  accurate  energy  of  the  2pcru  state, 
ice  Eq.  132).  The  dashes  mean  that  the  approximants  do  not 
xist  since  p>U)  *<jv.,(l)  is  negative. 


(.Y,.V  +  l)b 

yV,.V  *  1}C 

1 

0.40 

0.  2657364 

0. 232S721 

2 

0.  73 

-  1.7685817 

— 

3 

1.23 

0. 1348294 

— 

-r 

2.07 

0.0032573 

0.0058634 

J 

3.  48 

0.0027985 

0.0000389 

'> 

5.  36 

0.0004636 

-0.0000056 

- 

9.37 

0.0000085 

-0.  0000187 

16.  61 

0.0000073 

0.0000004 

J 

27.  98 

-0.0000020 

0. 0000001 

47.  13 

0. 0000006 

-:-.e  convergence  radius  of  the  series  equals  0.  73833870  and 
is  determined  by  the  smaller  of  the  branch  points  Xjf®** 

*0.  “7:54393*0.01320523. 

’Tie  ;.Y  X/|(X)  Pad4  approximant  to  <0  *<(X)  is  defined  as  a 
.totient  of  two  polynomials  pjyiX)/<j^(X)  chosen  such  that 
,'./|(X)  -t(X)  *0(X'W*').  The  listed  values  of  (JV/.V+  1| 

ire  slightly  different  from  those  published  in  Ref.  7.  The 
:  :ason  (or  this  is  that  in  Ref.  7  a  different  value  for  <0  was 
assumed  and  ( S/S  *  11  -«0  is  not  invariant  under  the  choice  of 

T  ,r  large  .V  the  energies  of  the  iso,  and  2 per,  states  are  ap- 
.  oxiraated  by  the  {.V..V  *  l}.  and  {.V ,.V  ♦  1/.,  respectively. 

.ten  \  is  small,  however,  {.V ,.V  ♦  l).  may  jump  to  the  upper 
ranch  of  and  y.V,.V*l}.  may  be  meaningless.  The 

.moers  listed  for  .V  »  1  and  .V  »4  are  just  (.V..V  *  l).  approxt- 
■■'.unts. 


TABLE  VII.  Convergence  of  me  MSMA  series  and  of  the  PadS 
and  {.V..V  *  l}  approximants  thereto  for  the  interaction  ol  a  hy¬ 
drogen  atom  and  a  proton  at  the  distance  12.  5  bohr.  The  un¬ 
gerade  symmetry  is  forced.  The  listed  values  show  the  percent 
error  relative  to  (he  exact  energy  of  the  2 poy  state,  see 
Eq.  (31). 


.V 

(.V-.V-Il0 

v V..V  * 

1 

0.  1S7200 

0.3214601 

0. 1554056 

•> 

0.033580 

-0.  3033939 

0.  0081292 

3 

0. 003557 

-0.  0033884 

-  0. 0036560 

4 

-  0. 002485 

-0.0041473 

-0. 004U344 

5 

-0.003747 

-0.0040897 

-0. 0040902 

6 

-0.  004016 

—  0. 0040884 

-  0. 0040933 

7 

-0.004074 

-0.  0043096 

-0.0037224 

8 

-0.  004088 

-0.0017780 

-0.01  10923 

9 

-  0. 004091 

-0.0003365 

0. 0000047 

10 

-0.004092 

-  0. 0000031 

-0.0000002 

“The  convergence  radius  of  the  series  equals  0.  9999231s  and 
is  determined  by  the  smaller  ol  the  branch  points  X*34* 

=  0. 9999240914  *0,  0000009055. 

“See  footnote  b  to  Table  VI. 

“For  large  S  the  energies  ot  the  lsar  and  2pca  states  are  approx¬ 
imated  by  {.V..V  +  l}.  and  {.V..V  -  l),,  respectively.  This  isthecase 
here  only  for  ,Y>12.  When  is  smaller  than  12,  {.V,  »v-  If. 
jumps  to  the  upper  branch  of  t“alA(X)  and  {.V..V  *  l}.  becomes 
meaningless.  As  a  matter  ol  fact,  all  numbers  listed  here 
are  {.V..V  *  l}_  approximants,  except  those  calculated  for  .V  =3 
and  .V  *4. 


r  <f“AX"=s(X)*/(X)(l  _  77- A) 62 

n«l 


f(x) 

1  -  q-X 


t{\) 

(1  -  q-X)3 


(38) 


where  we  have  chosen  the  plus  sign  for  the  square  root 
in  Eq.  (27)  to  stay  at  X  =  0  on  the  proper  Riemann  sheet 
of  c*51** (x).  Expanding  the  rhs  of  Eq.  (38)  about  x  *0  we 
get 


(39) 


where  s.  and  t„  are  the  coefficients  in  the  power  series 
expansion  of  s(x)  and  f(X),  respectively.  We  now  see  that 
the  MSMA  expansion  is  a  sum  of  two  series.  The  first, 
summing  s,  -  t„  -  -  nt„,  converges  quickly  to  9(1)  -  7/(1) 

and  the  second,  consisting  of  almost  constant  terms  of  the 
order  s2,  diverges  very  slowly  to  infinity.  Comparing 
this  with  Eq.  (37),  we  see  that  the  sum  of  the  first 
series  differs  from  the  exact  energy  by  ,52f(l)/(2qt. 

Thus,  the  fraction  of  the  interaction  energy  not  re¬ 
covered  is  -  5:A7(2cq2),  where  K  is  the  exchange  energy 
A'a  qf(  1).  Using  the  values  of  q  and  5  from  the  footnote 
of  Table  VII  we  find  that  this  fraction  is  -0.00408%,  in 
very  good  agreement  with  the  numerically  observed 
values. 


In  view  of  expression  (36)’s  success  in  explaining  (he 
convergence  properties  of  perturbation  expansions,  one 
may  wonder  why  this  expression  was  not  used  in  the 
numerical  analytic  continuation  instead  of  an  apparently 
more  complicated  formula  (28).  The  reason  for  this  is 
that  the  system  of  algebraic  equations  determining  s„ , 


J.  Chem.  PhvJ..  Vol.  23.  No.  12.  ’5  December  '980 


6224  Jeziorski,  Schwalm,  and  Szalewicz:  Analytic  continuation  in  perturbation  theory 

,  7,  and  a  is  highly  nonlinear  and  cannot  be  solved  in  <b.  Jeziorski  and  W.  Kotos,  Int.  J.  Quant.  Chem.  12,  Suppl. 
an  easy  way.  l,  91  U9TT). 

’B.  Jeziorski,  K.  Szalewicz,  and  G.  Chalasinski,  Int.  J. 

The  above  discussion  plainly  shows  how  important  Quant.  Chem.  14,  271  (1978). 

a  precise  information  about  positions  of  singularities  of  'g.  Chatasirtski,  b.  Jeziorski,  and  K.  Szalewicz,  Int.  J. 
the  interpolation  functions  is  for  an  understanding  of  the  Quant.  Chem.  11,  247  U977). 

convergence  properties  of  a  perturbation  theory.  The  B-  Jeziorski,  K.  Szalewicz,  and  M.  Jaszuhski,  Chem.  Phys. 


large  R  convergence  problems  of  all  perturbation  ex¬ 
pansions  considered  in  this  paper  are  due  to  the  presence 
of  branch  points  in  the  closest  vicinity  of  the  physical 
value  of  the  expansion  parameter.  The  specific  conver¬ 
gence  pattern,  however,  depends  critically  on  the 
orientation  of  the  branch  points  relative  to  the  point  X  =1. 
On  the  other  hand,  our  results  clearly  demonstrate  how 
useful  any  qualitative  information  about  singularities  of 
the  interpolation  functions  may  be.  Having  such  informa¬ 
tion  we  can  apply  a  proper  analytic  continuation  technique 
and  thus  effectively  cope  with  the  divergence  or  patho¬ 
logically  slow  convergence  of  perturbation  expansions. 
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The  lotal  electrostatic  energy  of  a  charge  distribution,  periodic  in  two  dimensions,  which  may 
consist  of  both  point  charges  and  a  continuous  distribution,  is  analyzed  in  detail.  From  the 
long-range  nature  of  the  Coulombic  force,  singularities  arise  which  must  be  treated  with  consid¬ 
erable  care.  The  Fourier-transform  method  developed  by  Harris  and  Monkhorst  is  carried  for¬ 
ward.  and  the  way  in  which  these  singularities  cancel  is  clearly  demonstrated.  A  convenient  pro' 
cedure  for  computing  total  energies  is  given,  and  various  applications  to  periodic  arrays  of  point 
charges  are  presented. 


I.  INTRODUCTION 

In  a  review  article  Harris'  has  outlined  the 
Fourier-transform  method  for  computing  classical 
Madelung  sums,  and  for  performing  Hartree-Fock 
calculations  on  periodic,  extended  systems  such  as 
crystals.  In  particular,  a  connection  has  been  made 
between  the  internal  structure  of  the  Coulombic 
singularities  in  such  calculations,  which  cancel  at  the 
origin  of  the  transform  domain,  and  the  multipole- 
moment  densities  of  the  repeating  units  which  can 
lead  to  surface  terms  in  the  real-space  continuum 
limit.  Here  we  wish  to  present  the  details  of  an  appli¬ 
cation  of  the  method  to  the  electrostatics  of  a  system 
with  two-dimensional  periodicity. 

The  types  of  charge  configurations  we  have  in 
mind  consist  of  film  arrays  of  nuclear  point  charges 
and  continuous  charge  distributions  arising  from 
delocalized  valence  and/or  conduction  electrons.  As 
a  special  case  we  will  consider  Dirac  8-function  local¬ 
ized  electrons  which  results  in  the  classical 
Madelung-type  problem  of  point  ions  localized  as  an 
array  periodic  in  two  dimensions.  No  limitations  are 
imposed  on  the  charge  distributions  in  the  perpendic¬ 
ular  dimension  in  any  way.  As  we  will  show,  this  ap¬ 
plication  can  be  effectively  formulated  for  fully  gen¬ 
eral  symmetries,  compositions,  and  structure  in  the 
perpendicular  direction.  Most  significantly,  the  ex¬ 
pressions  are  numerically  well  behaved,  in  the  sense 
that  reciprocal-lattice  summations  converge  rapidly  or 
can  sometimes  be  summed  analytically.  This  is  par¬ 
ticularly  true  for  the  point-ion  lattice. 

The  paper  starts  with  a  definition  of  all  the  quanti¬ 
ties  in  the  problem.  Subsequently  the  nuclear- 
nuclear  repulsion  energy  is  partitioned  into  terms  that 
are  treated  analytically  in  different  ways.  Then  fol¬ 
lows  the  Fourier  representation  of  the  electron- 
nuclear  and  eletron-electron  energies.  We  show  how 
singularities  cancel  and  what  the  role  is  of  any  dipole 
moment  of  the  repeating  unit.  Next  we  present  a 
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brief  outline  of  the  significance  of  the  results  in  per- 
foriming  self-consistent  calculations.  This  is  followed 
by  the  speical  case  of  a  S-function  localization  of  the 
electron  distribution  (leading  to  point-ionic  films) 
and  numerical  examples  demonstrating  the  efficacy 
and  flexibility  of  the  method.  Finally  we  present  in 
the  Appendix  a  summary  of  the  working  formulas  as 
derived  in  the  main  text. 


II.  LATTICE  AND  RECIPROCAL-LATTICE  VECTORS 

Consider  a  two-dimensional  lattice  spanned  by  the 
vectors  h,  and  h2.  An  arbitrary  lattice  vector  is 

■«  hi |  hi  +  w2h2  .  ( I ) 

The  index  m  stands  for  the  pair  of  integers  in,,  m2. 
The  systems  to  be  treated  below  must  have  the  trans¬ 
lation  symmetry  of  this  lattice,  bur  will  not  be  re¬ 
quired  to  have  any  particular  symmetry  in  the  other 
spatial  direction.  Assume,  however,  that  these  sys¬ 
tems  can  be  divided  into  layers  in  some  meaningful 
way.  and  let  n  be  an  integer  index  identifying  the 
layer.  Fixing  in,.  m2,  and  n  will  not  in  general  be 
enough  to  address  a  particular  atomic  site,  since  on  a 
particular  layer  and  within  a  given  period  there  may 
be  several  atoms,  each  at  a  different,  nonprimitive 
displacement.  Let  r  specify  the  particular  atomic  site; 
then  the  complete  position  of  an  atomic  site  is  given 
by 

R«.r“  ffw  f  R.r  ■  (2) 

The  nuclear  charge  associated  with  this  particular  site 
will  be  denoted  by  Z„,. 

It  is  convenient  to  define  also  the  reciprocal  lattice 
at  this  time.  The  three-dimensional  basis  is  chosen 
to  be  hi,  h2,  i.  where  :  is  the  unit  vector  perpendicu¬ 
lar  to  the  layer.  The  period  area  is  thus 

A()-i  •  <  hi  x  h2)  .  (3) 
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Reciprocal  basis  vectors  are,  as  usual. 

.  (4a) 

■4o 

kj-  —  (£xh,)  •  (4b) 

Ho 

A  basis  in  reciprocal  space  is  thus  provided  by  k|.  k2,  i. 


III.  PARTITIONING  OF  THE  LATTICE  REPULSIVE 
ENERGY  U 


Assuming  a  Coulombic  interaction  among  a  collec¬ 
tion  of  bare  nuclei,  we  see  that  the  energy  required  to 
assemble  this  lattice  of  positive  point  charges  is 


U~-  T  '  — 

U  I  n 


mmr 
m  n'r' 


R«»f —  R_ 


(5) 


The  prime  on  the  summation  sign  means  that  the 
term  (mi»r)«(m'»V)  is  to  be  omitted.  Even  with 
the  self-energy  term  omitted,  it  is  clear  that  the  infin¬ 


ite,  formal  summation  of  Eq.  (5)  cannot  converge. 
The  lattice  energy  of  a  bare,  positive  lattice  would  in 
fact  diverge  if  it  were  not  dressed  in  a  negative  elec¬ 
tronic  charge  distribution.  We  must  isolate  exactly 
this  divergent  part  of  the  summation  and  cancel  it  to¬ 
gether  with  the  divergent,  repulsive  part  of  the 
electron-electron  interaction,  against  the  divergent, 
attractive  electron-lattice  interaction.  Part  of  the  pur¬ 
pose  of  the  current  paper  is  to  isolate  these  diver¬ 
gences  and  arrange  for  their  cancellation  in  an  analyt¬ 
ical  way.  Thus  the  remaining  energy  contributions 
will  be  well  behaved  and  may  conveniently  be  dealt 
with  numerically. 

Following  the  development  of  Harris,1  we  intro¬ 
duce  the  Fourier  repesentation 


where  the  angle  integration  is  to  be  performed  first. 
We  should  like  to  insert  this  expression  into  Eq.  (5) 
and  interchange  summation  and  integration.  If  this  is 
done  carefully,  we  have 


U-  1  f  1 

2(2ir3)  •'!“!>«  k3 

2  '  Z„Z>vexp(-ik  •  (R^-R^y)] 

mnr 

m  it  r 

d3k  +  U, 


where  the  extra  term  which  has  been  plucked  from  the  region  around  the  origin  at  k  -  0  is 


(7) 


2'Z„Z.,f_,  -Vexpl-k- (K.MT-R  ,  ,  . 
*  2(  2ir* )  "rJ  lxl<«*: 


)  I  d3k 


(8) 


The  reason  for  this  exclusion  can  be  seen  by  examination  of  the  nature  of  the  summands  in  Eq.  (7).  The  sum 
does  not  converge  uniformly  in  k  for  k  in  the  vicinity  of  0.  Thus,  had  we  not  excluded  a  ball  of  radius  t  around 
k  -0,  the  interchange  of  summation  and  integration  would  not  have  been  possible. 

It  will  be  useful  for  the  sake  of  normalization  to  temporarily  suppose  that  the  range  of  summation  of  in i  and 
/»:  in  Eqs.  (7)  and  (8)  is  finite,  and  that  the  total  number  of  periods  is  where  A/  is  a  very  large  integer.  It  is 
then  possible  to  take  advantage  of  periodicity  and  write 


4irJ 


f  - 

J  I  k  i  >  «  k  2 

2  z3. 

2«*P<-'  k  ■  R„  )  -  1 

Ht 

1  m 

2exp(-/  k  •  R„) 


2'Z„rZ  .  .  expl /  k  •  (R„f-  R  .  .)! 


d3k 


(9) 


The  first  term  in  the  integrand  of  Eq.  (9)  accounts  for  the  repulsive  energy  between  sites  on  the  same  sublamce. 
i.e..  sites  with  the  same  n.  r  values.  The  second  term  is  the  intersublattice  repulsion. 

With  the  introduction  of  the  nuclear  structure  factor 


s  (  k  )  —  2  Z,Texp(  i  k  •  R„,)  . 

»T 

we  can  write 


U-U.  +  ~  f_  -\ 

4irJ  ■''*  >•  k2 


|s(k)l:  Jexpl-'k  Rm )  -  2Z,:t 


d3k 


(10) 
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between  the  harmonic  series  and  (he  logarithm. 
irM1 


Ux~ 


Aaki 


2z.\ 


ki 

-.cosr 


(25) 


where  F}  depends  on  the  geometry  of  the  reciprocal 
lattice  through  the  ratio  of  the  lengths  of  the  vectors 
k|  and  k2.  and  through  the  cosine  of  the  angle  y 
between  them 


(26) 


_  1 

A  o 

r  dlk  n 

-.cosy 

mk  2 

2  T 

/mOO 

1  -7“ 

*  M 

Fi 

If  we  express  ku  according  to 
k„-uki+vkj  , 


(27) 


according  to  Eq.  (14),  and  define 
f  (u,  v)  —  k  2(  k  ;u2  +  2kr  k;u  v  +  k]  v:)~'n 

“(a:u!  +  2a/uv+v;)'l/1  .  (28) 


where  a  is  the  ratio  k,/k:  and  /  is  cosy,  the  form  of 
Fi  then  becomes 


FAa.t ) 


2  /  ( /|  J:>  -  f  /  (  u.  v)  du  dv 
tma  J 


(29) 


The  difference  between  the  sum  and  the  integral  of 
any  function  f  (u.v)  over  a  simple  region  can  be 
evaluated  numerically  via  an  Euler-Maclaurin  formu¬ 
la1 


2  2/ /  /(u.  \j)du  dv- 2ft  J '  I  /o.,(«.d)-/a.,(u.c  )]du  +  2*.  v)  - q{<2.  V  )  ]  d  V 


where  i.j  range  over  all  odd,  positive  integers, 

/,j(iy,  u)  -  — -/  (u,  u)  ,  (31) 

du'M 

and  the  coefficients  ft  is  related  to  the  Bernoulli 
number  Bl+,  via 

ft -S, ♦,/(<  +  !)!  .  (32) 

In  the  double  summation  on  the  left-hand  side  of  Eq. 
(30),  boundary  points  are  to  be  given  half-weight. 
Thus,  for  example,  the  contribution  /  (/|,/j)  is  to  be 
mutliplied  by  7  if  /  is  a  corner  of  the  region  of  sum¬ 


)-fIJ(a.d)-flj(b.c)+f<J(a.c))  .  (30) 


mation  by  y  if  I  is  an  edge,  and  by  1  if  it  is  an  interi¬ 
or  point. 

To  evaluate  F we  divide  the  infinite  region  into 
nine  subregions  as  shown  in  Fig.  1.  Region  /  is  de¬ 
fined  by  restricting  the  range  of  /i  and  /-  to  be  from 
-L  to  L  for  some  Hxed  integer  L.  The  integral  of 
/  ( u,  u )  over  region  l  can  be  done  in  closed  form  us¬ 
ing  polar  coordinates.  Thus,  the  contribution  to  F} 
from  the  inner  region  /  is  evaluated  explicitly,  while 
the  Euler-Maclaurin  method  is  applied  to  the  infinite 
regions.  Taking  symmetries  of  /  fu,v)  into  account 
we  deduce  an  expression  for  F2(a,r): 


fj(or.r)- 

/</ 


In 


at  +  l  +  (a2  +  2a/  +  1  )I/J 

+  —  In 

at  —  1  +  (a2  -  2 at  +  1  )l/! 

a 

( a2  —  2 at  -f  1  )’^2  +  at  —  i 


(a2  +  2a/  +  1  ),/2  —  a  —  / 


1 

a 


JL 

(  n 

( a3L 3 2a/E  u  +  v3) 1/3  +  aZ.  +tv 

dv 

m 

la'L1  -  2atL  v  +  +fv 

-2  2ft 


9 

In 

9  u 

(aV  +  hxtuL  -*•  L!)'°  +  L  ->-atu  . 

7  j  '  '  ■,  J  .  ; ,  1/2  r  ,  I  2  2d'kj  i  (  L.Z.  )  f,tJ(  L,  ~~L  )  1 

( a w  —  2aiuL  +  L  )  "  —  L  +  atu  ^ 


(33) 


The  double  prime  on  the  summation  on  I  over  region  I  indicates  that  points  on  the  edge  are  to  be  weighted  by  y, 
those  on  the  corners  by  y  and  the  point  /  -0.0  is  to  be  skipped  altogether. 

Althougn  it  would  certainly  oy  possible  to  perform  the  differentiations  appearing  in  Eq.  (33)  explicitly,  this  has 
not  been  done.  Instead,  forward  tabular  difference  derivatives  have  been  used  and  the  size  of  L  and  the  number 
of  derivative  terms  included  in  the  expansion  adjusted  to  minimize  the  combined  roundoff  and  truncation  error. 

F >  is  thus  computed  as  in  Eq.  (33)  and  if,  results  from  substituting  this  into  Eq.  (25). 
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The  final  partitioning  of  the  V  expression  into 
computationally  convenient  pieces  depends  upon  the 
properties  of  the  remaining  m  summation  in  Eq.  (II). 
In  order  to  avoid  confusion  later,  we  must  proceed 
circumspectly.  Basically  we  have 

2exp(-.k-R.)--^  2S(k„-^)  .  (12) 

•  '■>0  / 


where 

k„-:x(kx-)-|T-f(f-lc)  (13) 

is  the  component  of  k  parallel  to  the  plane. 

R,-/ik,  .  (14) 

with  / 1  and  /;  integers,  is  an  arbitrary,  reciprocal- 
lattice  vector.  The  8  symbol  appearing  in  Eq.  (12)  is 
not  Dirac's  8.  but  rather  it  is  a  peaked  function  of 
finite  width  which  depends  on  Xf.  the  range  of  sum¬ 


mation.  In  fact. 

8(k,)-8w(h|- k)8*<  hr  k)  . 

with 


8v  (.<r )  — 


sin ( (  M  +y  )jx  1 


The  function  8(  k,i )  peaks  at  k,i-0  and  has  its  first 
zero  on  the  perimeter  paraliogram  the  reciprocal  area 
of  which  is  4irV/4„.  From  this  point  on.  we  suppose 
that  the  excluded  region  of  radius  t  is  small  com¬ 
pared  to  the  reciprocal-lattice  spacing,  but  that 

«  ire2  «  —  .  (17) 

M2A  o  4o 

so  that  the  central  peak  remains  tightly  within  the  re¬ 
gion  of  integration  of  U,. 

Thus,  we  arrive  at 


V/2 

i„2  r-  |s(K,  +  k,:  )|2 

Hi  _  X  7  2  )  f  diL 

4ir2  cr  )|* 

4  rr2 

4o  i -on  K2  +  k2 

k‘  ~Z"T  J  k2 

40  0.m  kt 

The  slash  through  the  final  integral  sign  indicates  that  the  range  -«  to  «  is  to  be  excluded.  The  exclusion  of 
the  ball  of  radius  «  has  been  omitted  from  the  second  integration  in  Eq.  (18)  since  the  integrand  is  continuous 
there.  Regrouping  terms  yields 

U-Ut  +  Ui  +  Ut  +  U,  .  (1 


mML  r  I—  y  1  f  d2k" 

'  4„2  4o  .Too^rk2  J  *i?+A,; 


7  dk,  , 


W2  r-  expf»(K,f-R  ..)•  ( K, -t- Axf )  1 

“'-t:  - -K fk- - *■  • 


.v/2  cr  I* 


.  .u1  F 


1 - dk,  . 


The  k,  integration  can  easily  be  done  in  Eqs.  (20) 
and  (21)  giving 


-  ~  2- 

* n  f^oo  1 

and 

,,  _  rr  W2  y- 

Z,rZ  ■  . 

x*  *  f 

L'- - j  2. 

'll  nr 

tmw  A  f 

*  r 

xexpliK, 

-|R,|  |£-(ff„-i?iiV)|] 

(24) 


It  is  quite  clear  that  U3  of  Eq.  (22)  diverges  as 
«—0.  This  is  the  divergent  term  which  must  be  bal¬ 
anced  against  the  divergent,  electron-lattice  attractive 
energy,  as  noted  above. 

We  now  evaluate  the  terms  of  Eq.  (19)  in  detail. 


IV.  EVALUATION  OF  (/, 

As  presented  in  Eq.  (23).  U\  is  the  difference 
between  a  divergent  sum  and  a  divergent  integral.  In 
this  sense  it  is  rather  like  a  two-dimensional  generali¬ 
zation2  of  Euler's  constant,  which  is  the  difference 
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FIG  1  Summ.uion  regions  for  Euler- VtcLaurm  summa¬ 
tion!..  /  is  the  inner  region  over  which  sums  and  integrals 
.ire  explicitly  performed.  In  regions  2.  4.  5.  and  7  the  one- 
Jimensional  Euler- McLaurin  formula,  together  with  a  finite 
summation  is  used  In  regions  I.  3.  6  and  8  the  iwo- 
Jimcnsional  Euler-McLaurm  formula  is  applied.  The  case  if 
lustrated  is  that  of  t«i. 


V.  EVALUATION  OF  Uj 

Whenever  in  Eq.  124)  it  is  the  case  that 

t£„-ff  .  ,)■:  *0  .  (34) 

then  the  I  summation  converges  exponentially.  As  a 
result,  there  is  no  need  for  further  reduction  of  the 
expression  and  Eq.  (24)  may  be  used  directly.  How¬ 
ever.  there  will  be  cases  when  there  is  more  than  one 
inequivalent  sue  in  the  same  plane,  and  then  Eq. 

<341  does  not  hold.  Thus  a  more  generally  conver¬ 
gent  expression  is  desirable. 


Define 

P(jfi.jrj.>“.ar/ ) 


_  V  /'*'/?■  expf-v (a*’/,-’  -i-  2ar/,/; 

4-/2  )l/!l 

1  *  00  l  a  ‘  /  f  *t*  2  a  tl  i  l  2  +  / 

;  )‘« 

(35) 

with  a  and  r  again  being  k  jk ,  and  the  cosine  of  the 
angle  between  k,  and  k;.  and  with 

Xi  »  k|  •  (  R,r  —  R^.f. ' 

<  36a) 

,x.- kv  <R„-R  .  .)  . 

•  ■  X  f 

(36b) 

and 

(37) 

Thus  P  depends  upon  the  atomic  positions  and  shape 
of  the  period  but  not  on  the  size.  In  terms  of  P 

Z'Z„Z  .  .P(.r„a-;..,  ;a.r)  . 

^  0*  ?  nr 

n  r 

(38) 

P  must  now  be  reduced.  First. 

P(.T,..t;.>  ;a./ )  •  Pi  ( x ;..v  )  4-  p.t  x ,.Xj.y\a.i  ' 

.  (39) 

where 


2 

1 2  *n 


expt  |/;.V;  ) 

i/Ti 


e  • 


(401 


and 


P:(x,.x2.\  ;a.r  >  —  V 


"  e  expt  —  f  ( a:l f  4-  2arf|/:  +•  /;  )l/:l 


2_ 


(  a' 1 1  4-  2af/|/;  +  / 2  >'^J 


(41 1 


is  closely  related  to  a  geometric  series  and  can  be  summed  exactly  while  P:  may  be  transformed  by  means  of 
the  Poisson  summation  formula.4  The  results  are 

P;  (  r:.>' )  * -Inlcoshy -cos*; >  4- v  —  In2  .  (42) 

m  m 

P:l  (i.rij  .o.l  )*4  J  2  (cosl  (ai  —  x>at  —  2rrmor  In  I  A',)!o(  I  —  r  ) l/*  |(  r:  a-  2ir/n  )-  4-  y:  )l,: ))  .  143) 


K„  in  Eq.  (43)  is  a  modified  Bessel  function.  U:  is  evaluated  by  using  Eqs.  (42)  and  (43)  for  the  P's  of  Eq.  (38). 
Equation  (43)  is  now  exponentially  convergent  unless  .r  and  are  both  zero,  in  which  case  k  :  is  interchanged 
with  k  , . 


VI.  CANCELLATION  OF  SINGULARITIES 

The  two  remaining  terms  in  the  expression  for  the  lattice  repuslive  energy  U,  namely.  t'i  and  L',.  contain 
divergences  which  will  now  be  shown  to  balance  against  one-half  of  the  electron-lattice  interaction  energy 
U,  may  be  rewritten  without  the  exclusion  of  the  finite  number  of  terms  for  which  and  R..  -  r  ■ 

since  each  of  these  vanishes  linearly  with  t 
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U 


.--^2  2Z„2  At  expt-1  k  •  ( +  R„ - R  .  - )  1  dlk 


(44) 


Doing  the  angle  integration  yields 


z„Z'V £ Joik IK.  +  K„.,l ) dk  . 


m  nr 
n  r 


where 

j0(x)  —  sinx/jr 
and 

R.„v-R.r-R  -  •  • 


(45) 

(46) 

(47) 


come  from  terms  for  which  |R„|  is  of  the  order  of 
R„,x.  It  is  clear  that  for  such  summands 

«lff.-KMiiVt  »«*«.»  I  .  (51) 

the  latter  inequality  following  from  Eq.  (17).  Thus, 
using  the  above  results,  each  significantly  contribut¬ 
ing  integral  is  very  nearly  n/Rmxx,  and  the  number  of 
such  terms  is  clearly  proportional  to  M2.  So,  again 
using  Eq.  (17), 

u,ae  (M2HM2)(ir/Rm,x)  ~  M}  .  (52) 


Consider  now  the  integral 


/  ( a,  € )  *  f 

Jo 

i 

j0(ax)dx  . 

(48) 

/(a.  *)  -  — 
a 

f  sina<Jfdx  . 

Jo  x 

(49) 

If  at  is  much  larger  than  ir.  then  /  becomes  approxi¬ 
mately  ir/a,  while  if  at  is  much  less  than  ir,  I  is  t. 
From  this  it  is  seen  that  one  can  estimate  the  size  of 
the  summands  in  Eq.  (45).  M  is  the  limit  of  the 
magnitude  of  m ,  and  in  j,  thus  the  magnitude  of 
|R_  —  R  .  .1  is  bounded  by  a  number  in  the  order 

nrn  r 

of 

(50) 

But.  from  the  discussion  resulting  in  the  restriction 
Eq.  (  17),  most  of  the  contributions  to  the  summation 


Therefore  this  term  also  is  divergent. 

The  divergence  of  U„  which  may  be  called  an 
inner  divergence,  since  it  occurs  near  the  origin  in  re¬ 
ciprocal  space,  will  be  balanced  against  an  exactly 
similar  term  in  the  electron-lattice  attraction.  Like¬ 
wise,  another  term  will  appear  in  the  electron-lattice 
attraction  which  will  have  a  divergence  with  exactly 
the  form  of  that  noted  in  Eq.  (22).  These  two  diver¬ 
gences  must  also  be  permuted  to  cancel. 

The  expression  for  the  total  electron-lattice  energy 
is 

(K)--Jp(T)  2-  %—  -d2r  ■  (53) 

mm r  I  T  Rfimrl 

Here,  p(7)  is  the  total  electronic  number  density,  so 
that  the  minus  sign  is  treated  explicitly.  Again  using 
Eq.  (6) 


f  -r.  ,  1  f  exp(-/k-(T-Stm„)l  ,,, 

<K)~J  p(7)l Z"  J - T2 - d  k 

once  more,  cautiously  interchanging  the  sum  and  the  integral 

00“<>0.+  00,  . 


dlr  ; 


where 


00.- 

1 

fp(7)  2Z„  ( 

IWIf 

f  expi-ik-  (7-R„.,)l  Jlf 

2irJ  • 

«nTi<« 

u 

and 

<>0.- 

1 

~  In2  - 

[  e-,T  Tp(7)d3r|  d>k 

(54) 

(55) 

(56) 

(57) 


Equation  (57)  appears  to  contain  the  Fourier  integral  transform  of  the  number  density  p(7).  Care  is  again  re¬ 
quired,  since  this  integral  does  not  exist.  We  require  two  properties  of  the  electron  density:  (i)  p(T)  is  periodic 
over  the  lattice  defined  by  Eq.  (1);  (ii)  the  integral  of  p(7)  over  one  period  is  exactly  Q  where 

<?-2*.»  (58) 
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Thus,  if  the  size  of  the  lattice  is  again  temporarily  re- 
-tncted, 

J p(T)e-'7  Td}r~Mip(  k)  .  (59) 


lattice  repulsive  energy,  one  has 


-fj:. 


5  ( k,z  ) 

k,} 


•vnere  p(  k )  is  an  electronic  structure  factor  defined 

oy 


x  Is  ( -k,z  )  -  p(k:i  ) )  dk,  . 


p(k)~  f  If  p(7)e“' k  T  dx  dy\dz  .  (60) 

**  —m  yj  period  1 

Chen  Eq.  (57)  for  {V),  divides  again  into  two  parts: 

(f'),-<k')n  +  (l')i  ■  (61) 

» an 

•  ~^r. 

I  s  (Kf  +  k,z  )p(  R,+k,:  ) 


X  1 

in  oo 


K,!  +  k,1 


dk,  . 
(62) 

The  completeness  relation  Eq.  (12)  has  been  used  to 
produce  Eq.  (62).  Likewise 

OOj - —s(ktz)p(kti)dkx  (63) 

■40  J  ~~  k,‘ 


fomparing  Eq.  (63)  with  the  Uj  contribution  Tor  the 


(64) 

The  singularity  as  the  excluded  region  from  -t  to  c 
(denoted  by  the  bar)  tends  to  zero  has  now  been  can¬ 
celed,  for  s  and  p  are  analytic  for  all  real  k,  and  since 
they  are  each  equal  to  Q  of  Eq.  (58)  as  k  —0,  their 
difference  has  at  least  a  simple  zero  at  k  -5.  Thus 
Eq.  (64)  has  a  finite,  Cauchy  principal  value 

limU/j  +  y^bJ-tconsOM2  •  (65) 

i— o  1 


Expression  (62)  for  ( t/)u,  which  is  of  a  form 
similar  to  that  of  U}.  is  perfectly  well  behaved  and 
needs  no  further  reduction.  The  method  of  produc¬ 
ing  the  electronic  structure  (actor  will  be  presented 
subsequently. 

The  combination  of  U,  and  -y  ( V) ,  must  now  be 
examined.  After  performing  the  angle  integration  in 
Eq.  (56)  and  taking  periodicity  into  account 


<»">.-  2  Z"'  X.noa  p  ( 7 1  X*  1  R"  +  R-  '  7  1  ’  ^  dy  d: 


(66) 


which  is  very  similar  in  form  to  Eq.  (45)  for  U,.  The  important  tool  for  evaluating  both  expressions  is  the 
faylor-series  expansion1 


f'j0(k\R+?l)dk-±. 


Ra~  3(Ra  )*-R1a1  ,  ,  r.,  (Ra)2 

SK«*)+  - - jp -  lstn( «/?  )  -SiUR  )] - jp- 


sin(«R  )  ,  „  . 

- - - <  costtfl  ) 

+  0 

a 4 

R 

(67) 


where  Si(x)  is  the  sine  integral  function  defined  by 

si<x)-  r— ^  • 

•'O  f 

Applying  this  to  Eq.  (66)  and  keeping  only  the  terms  which  can  contribute  at  large  Rm 

+  R"  '  (R;ff~ "  !sin(  tRm )  -  Si(tfl„  )  1  4- 


(68) 


(69) 


■Mth  Q  given  by  Eq.  (58)  and  F-  being  the  coordinate  within  a  cell  of  the  centroid  of  the  electron  distribution 


Likewise,  treating  U,  in  a  similar  way,  the  result  is 


(70) 
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(/,-  +  —  XlZ.r 

^  m  nr 


-0-SUtRm)  + 
Rm 


R„  - (ff„,(?- P,)  , 

- ry -  [sin(e*„)-Si(  «*„)]  + 


(71) 


with  P+  the  centroid  of  the  nuclear  charge  distribu¬ 
tion. 


p*-  y  z ,  ,r  , .  . 

it  r  nr 


Therefore,  the  sum  for  small  t 


(72) 


y.+T<0.-—  IC 


R«-(P.-PJ 


against  similar  singularities  in  y(P)  has  thus  been 
demonstrated.  Without  the  vanishing  of  the  parallel 
component  of  the  total  dipole  moment,  the  long- 
range  terms  in  Eq.  (73)  become  difficult  to  handle. 

Since  it  is  very  reasonable  to  ask  what  happens  to 
the  other  half  of  the  singularity  in  <  P),  the  answer  is 
provided  here  as  follows:  the  other  half  of  the  ( P) 
divergence  cancels  against  a  similar  divergence  in  the 
electron-electron  repulsive  energy  C 


x  [sin(«/?„ )  -  Sit  tRm )  ]  .  (73) 

One  sees  that  if  the  total  dipole  moment  of  the  cell 
has  no  component  in  the  plane  of  the  lattice,  then 
there  is  strict  cancellation  and  the  sum  of  these  two 
terms  is  zero  in  the  limit  «  —  0.  Under  this  condi¬ 
tion.  the  exact  cancellation  of  all  singularities  in  U 


c-i  r  . 

•  J  iT-T'l 


(74) 


Proceeding  as  in  the  treatment  of  (  P),  translation 
symmetry  produces 


C-C.+C  . 

where 


with 


C, 


and 


rj 

rr  ~  period 

p<H£ 

-c,2  +  c, 

■ 

_  m1  r- 

T  - 

(k,z  +K, 

~  A  o  J  — 

/  *00 

k:2  +  K,2 

M2  p* 

’  aq'J— 

\p(k,z) 

\2 

—  dk,  . 

dk,  , 


(75) 

(76) 
(77-) 

(78) 

(79) 


Cu  is  finite  and  needs  no  further  reduction.  Applying  Eq.  (67)  to  Eq.  (76)  for  C,  yields  for  the  nonvanishing 
part 


c..ti£i2r /  ,m 

TT  -  J-mJ  period^ 


-^■Si i(Rm)  4 — - - —  [ sin ( «/?„ )  —  Si( «/?*)!  + 


RJi 


dx  dv  dz  . 


Whence,  for  small  t  one  has  again  as  in  Eq,  (73) 
R„-(P_-PJ 


c,+t<p>.-—  2Q 

*  7T  _ 


(sin(«/?„)-Si(«fl*)]  . 


(80) 

(81) 


which  vanishes,  provided  the  net  dipole  moment  is 
perpendicular  to  the  plane  of  the  lattice. 

The  cancellation  of  the  singularity  in  C j  against 
that  of  y  (  P)j  follows  from  a  discussion  entirely 
similar  to  that  preceding  Eq.  (65).  The  conclusion  in 
this  case  is  that 

lim  ( Cj  +  y  +  (  P)  j)  -  ( const )/Vf!  .  (82) 

«— o 


VII.  TOTAL  ENERGY 

All  of  the  essential  ingredients  are  now  at  hand  to 
express  the  total  energy  of  quantized  systems  with 
translation  symmetry  on  the  two-dimensional  lattice 
defined  by  Eq.  (1).  It  will  be  assumeed  that  the 
results  of  a  self-consistent  electronic  structure  calcu¬ 
lation  are  available  in  the  form  of  band  energies  and 
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Bloch  functions,  and  furthermore  that,  as  a  result  of 
the  self-consistent  procedure,  the  Fermi  energy  £>  is 
also  known.4 

If  there  is  no  spin-orbit  coupling,  each  single¬ 
particle  eigenstate  for  the  >uh  band  satisfying 

tfU.  k.1)-£.(kll)U.  k«>  (83) 

will  be  doubly  occupied  if  the  band  energy  £M(kn)  is 
less  than  £>  and  unoccupied  otherwise. 

The  one-electron  operator  H  may  generally  be  as¬ 
sumed  to  consist  of  kinetic  energy  7,  electron-lattice 
interaction  V,  Hartree-like  electron-electron  energy  J, 
and  possibly  an  exchange  operator  K 

H  -f  +  V  +J  -  K  .  (84) 


exchange  contributions  to  £  in  Eq.  187). 

2£„(  k,|)  QiEf  —  £M(  kj|))  d2ku 

+  U  +  j(T)  +~(V)  .  (91) 

The  U  and  y  (  V)  evaluations  must  be  done  together 
in  a  w$y  similar  to  that  presented  above  in  order  to 
ensure  numerical  cancellation  of  the  singularity.  Oth¬ 
erwise,  these  two  parts  will  provide  large  numbers, 
computed  in  generally  different  ways,  which  then 
must  almost  cancel  to  give  a  finite  result. 


VIII.  DISCUSSION 


Define  the  ground-state  expectation  value  of  an  arbi¬ 
trary  operator  ft  by 

(ft>-2-^r  f  20(£>-£„<  k,,)) 

—  4„- 


*  (p.  kill  ft  l/i.  k||)  d]k n 

with 


0(x) 


0.  if  x  <  0 
1.  if  x  >  0 


(85) 

(86) 


and  the  integration  being  over  any  convenient  period 
of  the  reciprocal  lattice  defined  by  the  vectors  k|,  k2 
of  Eq.  (4).  The  total  energy  of  the  system  is  not  the 
sum  of  the  single-particle  energies,  which  would  be 
the  expectation  value  of  H.  This  is  because  the 
operators  J  and  ( -K  ).  which  depend  in  a  nonlinear 
way  on  the  solution,  contain  an  averaged  electron- 
electron  interaction.  The  expectation  value  (J  —  K) 
would  count  this  interaction  twice,  once  from  the 
point  of  view  of  each  electron.  On  the  one  hand,  the 
total  energy  per  period  is  given  by 

E-(T >  +  <('')  +C+X  +  U  ,  (87) 

where  U  is  the  lattice-lattice  repulsive  energy  as  treat¬ 
ed  above,  and  where 

C-UJ)  (88) 

and 

Af-y(-Af)  ,  (89) 

the  exchange  energy.  On  the  other  hand,  the  sum  of 
the  single-particle  energies  is 

T  — rf  2£„(  kn)6(£f  —  £„( iqi)>  d*kn 

**  Air1  J  ior\t 

* 

-<//>-<r>  +  <i'>  +  <y>  +  <-*)  ■  (90) 


The  present  analysis  of  the  classical  electrostatic 
energy  of  two-dimensionally  periodic  systems  has  led 
to  a  proliferation  of  many  expressions,  some  of 
which  are  quite  involved.  Therefore,  in  order  to  aid 
the  reader  in  implementing  this  formulation  we  have 
summarized  the  definitions  and  the  working  formulas 
in  the  Appendix.  We  have  grouped  terms  in  a 
manner  convenient  for  computation. 

It  is  worthwhile  pointing  out  that  the  determination 
of  p<7)  has  to  be  made  in  some  independent  way. 
such  as  from  Hartree-Fock-type  calculations4  or  from 
some  experiment. 

It  is  also  possible  to  obtain  an  expression  for  the 
electrostatic  potential  at  an  atomic  lattice  site.1  One 
obtains  this  formally  by  the  expression 

4>(R„T)-i|i  .  (92) 

5Z„ 

thus  leading  to  the  potential  at  the  atomic  sites  at  po¬ 
sition  with  respect  to  the  origins  of  the  unit 
periods.  If  one  wishes  to  determine  the  potential  at 
an  arbitrary,  nonlattice  point  R  in  the  slab  one  simply 
adds  this  point  to  the  lattice  summations  with  “nu¬ 
clear”  charge  Z  -0.  and  uses  Eq.  (92).  This  pro¬ 
cedure  is  described  in  Harris  and  Monkhorst5  for 
three-dimensionally  periodic  solids.  We  will  present 
applications  of  this  approach  in  a  forthcoming  paper.” 

Finally,  we  can  specialize  our  formulations  for  the 
case  of  point-charge  distributions  only.  This  can  be 
achieved  by  introducing  the  expression  for  p(7) 

p(7>- 2  v„f5<7  -  R„„t)  ,  (93) 

mar 

i.e.,  a  distribution  of  discrete  number  densities  v„,  at 
nuclear  sites  Rm,r.  But  then  (  V)  and  C  reduce  to 

V„,Z  i  i 

O') - V  _ - ±1 -  ,  (94) 

"»»  ~  R  ■  '.'I 


A  most  convenient  method  of  extracting  the  total  en¬ 
ergy  from  a  self-consistent,  energy-band  calculation  is 
to  use  half  of  Eq.  (90)  to  evaluate  the  Coulomb  and 


mar 


(95) 
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The  primes  to  above  summations  indicate  that  we 
have  left  out  the  (infinite)  self-energy  terms,  just  as 
in  the  expression  for  U.  It  is  easily  seen  that  £a  as¬ 
sumes  the  form 


£o 


i  v 

m* t  ~ 


(96) 


where  — 2,,- v,f  are  the  net  charges  at  sites  R„. 
Indeed,  we  have  obtained  the  expression  for  the 
Coulomb  interaction  between  point  charges  <?„,  local¬ 
ized  at  sites  Comparing  this  results  with  Eq. 

(5)  for  U  we  see  that  the  two  expressions  are  of  the 
same  form,  with  the  charge  neutrality  now  assuming 
the  form 


Sd.r-O  • 

Hr 


(97) 


If  we  introduce  the  lattice  structure  factor 


<t(  k  )  -  ^<7,,  exp(/ k  •  R„,)  .  (98) 

Hr 


we  can  immediately  write  the  expressions  for  £d, 
after  simple  manipulations  starting  from  Eqs.  (22), 
(25),  and  (38): 


£ci  -  £i  +£j  +  £  i 

• 

(99) 

r  _  1 

■4o^t  ] 

nr 

F;(a.l)  . 

(100) 

_.  nM 2 

E} “ 

2  . 

Hr 

(101) 

h  r 

_  rr  M1  v 

£) - —  £  q, 

.r 

'V/lz-  (R.r-R.yil  ■ 

(102) 

nr 


In  Eqs.  (100)  and  (101)  the  arguments  of  the  func¬ 
tions  F;  and  P  are  the  same  as  those  used  in  the  pre¬ 
vious  text  and  the  Appendix. 


IX.  EXAMPLES:  APPLICATION  TO 
POINT-IONIC  FILMS 

Table  I  presents  results  for  the  lattice-structure- 
constant  evaluation  according  to  Eq.  (33)  The  angle 
0  is  measured  either  between  h,,  hj  (not  between 
1?,.  It;).  When  9  is  rr/2,  results  may  be  compared 
with  those  of  Table  I  of  Harris.  Agreement  is  found 
in  every  such  case.  The  unit  cell  formed  in  the  case 
I  h|l  -  1,  I  h2|  -  v2,  and  9  -  rr/4  actually  spans  the 
same  lattice  as  that  spanned  by  I  h,|  -  1,  lh2|  -  1, 

9  —  it/2.  Agreement  is  exact,  although  the  sums  are 
evaluated  differently.  The  time  required  on  the 
University  of  Utah  College  of  Science  DEC  system 
20/60  to  compute  F:  ranges  from  about  0.19  sec 
when  ,V  —  5  to  6.3  sec  when  .V  -  100  and  varies  as 
N*. 

Madelung  constants  are  presented  in  Table  II  for  a 
layer  of  the  NaCI  structure.  The  parameter  : 
represents  the  displacement  of  one  sublattice  relative 
to  the  other  in  the  direction  perpendicular  to  the 
plane.  Normally,  when  the  Evjen  method  is  used  to 
compute  the  Madelung  constant,  the  nearest- 
neighbor  distance  is  set  to  I.  This  means  that  the 
nearest  neighbor  on  the  same  sublattice  is  at  a  dis¬ 
tance  of  \/2.  When  this  is  taken  into  account,  the 
results  of  the  two  calculations  agree. 

As  the  angle  9  between  h,  and  h2  decreases,  the 
Madelung  energy  becomes  more  and  more  negative 
as  the  ions  of  opposite  sign  come  together.  For  lat¬ 
tice  spacing  o.  as  0  —  0.  the  sum  correctly  tends  to 
the  one-dimensional  limit 

fci-~ -  ■  (103) 

a  0 

When  one  sublattice  is  displaced  relative  to  the  other, 
the  layer  is  separated  into  oppositely  charged  sheets. 
Thus  the  energy  begins  to  increase,  and  work  must 
be  done  as  :  becomes  larger.  The  increase  in  the  E 3 


TABLE  I.  Lattice  structure  constants. 


lh|l.  lhjl.0 

a.t 

,v 

F j(a./ ) 

l.  1.  it/2 

1.0 

5 

-3  900169466 

1.  1.  it/ 2 

1.0 

10 

-3  900  250  582 

l.l.tr/2 

1.0 

50 

-3.900  264  802 

1.  1.  nil 

1.0 

100 

-3.900  264  905 

1.  l.ir/2 

1.0 

150 

-3  900  264916 

1.  1.  nil 

1.0 

200 

-3  900264918 

1.  Si.  it/4 

V2.  I/V2 

100 

-3  900  264  905 

1.  1,0.40* 

1.0.30902 

100 

-4011  7104 

1.  1. 0,45* 

1.0  15643 

100 

-3.927  982  0 

1.  1.0.50* 

1.0 

100 

-3  900 2649 

1.  1.  it/3 

1.  VI/2 

100 

-4.2134226 

I.V8/3.  it/2 

V8A3.0 

100 

-2.926679  1 
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T  xBLE  II.  Madelung  energy  of  distorted  NaCI  layer  I mtx  and  ±  Jmxx  are  ranges  of  «  and  m 
summations  for  Eq.  143) 


ih,  i.  thjl.:.  9 

£cl  value 

Time  in  seconds 
on  DEC  system  20/60 

v2.  vl. 0.  ir/2 

5.5.5 

-1.61547 

0.118 

v  2.  v  2.  0.  rr/2 

25.5.5 

-1.61554 

0  482 

1.  1.0.  ir/2 

25.5.5 

-2.284  72 

0.482 

1  1 .  i).  0  45  it 

25.5.5 

-2.328  45 

0  479 

1.1.0.  0.40* 

25.5.5 

-2.463  38 

0  476 

1  1 . 0.  0  35rr 

25.5.5 

-2.702  192 

0  488 

1.  1.0.0.30* 

25.5.5 

-3  071  875 

0  476 

1.  1.0.  ir/4 

25.5.5 

-3.627  312 

0482 

1.  l.0.l.ir/2 

25,5.5 

-2  203  007 

0  481 

1.  1.0.2.  tr/2 

25.5.5 

-1.967  742 

0  478 

1.  1.  0  3.  ir/2 

25.5.5 

-1  604450 

0  476  ■ 

1.  1.0.4.  */2 

25.5.5 

-1  145  134 

0  478 

1.  1 . 0.5.  tr/2. 

25.5.5 

-0619819 

0  482 

a-0a’*  o-o’"  t-Ti4* 


1  IG  J  Example  of  a  double  layer  of  barium  titanale. 
c  arrows  indicate  the  motion  of  the  positive  ions  under 
e  distortion  :  ol  Table  III  Notice  that  witen  -0  5.  Ti4* 
ns  *  ill  coincide  with  0-“ 


term  is  responsible  for  most  of  this  energy 
As  a  final  example,  consider  two  layers  of  the  bari¬ 
um  titanate  structure  as  illustrated  in  Fig.  2.  Ti'4*' 
ions  are  located  at  the  body  centers  of  each  cube, 
while  Ba12*'  ions  are  at  ihe  face  corners.  O'2”’  ions 
are  at  the  face  centers.  In  all.  there  are  two  Ti14*’. 
two  Ba'24-',  and  six  O'2"'  ions  or  a  total  of  ten  ions 
The  total  electrostatic  energy  in  this  configuration 
is  -134.1 17,  in  units  of  electronic  charge  squared  di¬ 
vided  by  the  distance  between  neighboring  barium 
ions.  Table  III  shows  the  variation  in  the  contribu¬ 
tions  to  this  energy  and  to  the  total  as  the  charge  ar¬ 
rangement  is  distorted.  The  distortion  is  one  in 
which  the  positive  ions  move  up  together  in  the 
direction  perpendicular  to  the  base  plane  of  the  thin 
film,  while  the  negative  ions  remain  fixed.  Such  a 
displacement  would  be  associated  with  a  ferroelectric 
transition  in  the  bulk.  Distortion  is  also  measured  in 
units  of  the  lattice  parameters. 

Originally,  the  energy  goes  up  with  the  displace¬ 
ment,  as  it  did  in  the  NaCl  case,  but  eventually  it 
would  diverge  in  the  negative  direction  by  the  time 
:  —0  5.  since  at  that  point,  the  Ti'4*'  ions  would 
touch  the  O'2*'  ions.  Thus,  there  is  a  maximum  in 
the  electrostatic  energy  of  about  -109  near  :  -0  3 
A  better  way  to  treat  the  electrostatics  of  systems 
of  point  charges  would  be  to  present  potential  coeffi¬ 
cients  such  that  the  total  energy  is  expressed 

as 

"04) 
*  V 

This  is  planned  to  be  done  for  both  thin  films  and 
semi-infinite  geometries  m  a  subsequent  publication  “ 
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TABLE  111.  Madelung  energy  for  double  layer  of  BaTiOj.  In  each  case,  £,  -  —  171.61 16,  and 
time  —  IS  sec  on  DEC  system  20/60. 


Distortion  : 

Et 

Ei 

Eet-E,  +£j+£j 

0.00 

37.4944 

0.0000 

-134  1171 

0.05 

21.1070 

17.5929 

-132.9117 

0.10 

6.9702 

35.1858 

-129.4555 

0.15 

-5.4024 

52.7788 

-124.2353 

0.20 

-16.8694 

70.3717 

-118.1093 

0.25 

-28.8401 

87.9646 

-112.4871 

0.30 

-43.8401 

105.5575 

-109,9024 

0.35 

-67.4202 

123.1504 

-115.8814 
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APPENDIX 

We  will  summarize  the  working  formulas  for  the 
evaluation  of  the  classical  electrostatic  part  of  the  to¬ 
tal  energy  for  a  film  of  atoms  with  two-dimensional 
periodicity. 

Geometry  definitions 

(1)  The  two-dimensional  lattice  is  spanned  by  the 
vectors  hi  and  h2.  An  arbitrary  lattice  vector  is 

R„  —  r«|  hi  +  m2h2  ,  (Al) 

where  the  index  m  stands  for  the  pair  of  indices 
(m,.m2),  each  ranging  over  M  values. 

(2)  n  is  the  layer  index  and  r  specifies  a  nonprimi¬ 
tive  displacement  for  an  atomic  site.  Then  the  com¬ 
plete  specification  of  an  atomic  site  is  given  by 

it ""  R-  7*  .  (A2) 

(3)  The  nuclear  charge  at  the  site  R„m,  is  denoted 
by  Z|T. 

(4)  The  period  area  is  denoted  by  -40,  and  it  is 
given  by 

40"i  •  (  h|  x  h2)  ,  <A3) 

with  :  the  unit  vector  perpendicular  to  the  layer  plane 
spanned  by  I K*  ). 

(5)  Reciprocal  basis  vectors  are 

k,-i3L{h2xf)  .  <A4a) 

Aq 

k,-l2L(i  x  h,)  . 

A  o 


Reciprocal-lattice  vectors  are  denoted  by 

K,-/,k,  +/2k2  .  ( A 5 ) 

(6)  The  electronic  number  distribution  throughout 
the  film  is  denoted  by  p( 7).  This  quantitiy  is  period¬ 
ic  in  the  lattice  vector 

p(7  +R„)  — p(T)  .  (A6) 

Its  Fourier  transform  is  expressed 

p(k  )  "  J* p(T)f*'~'T  dV  .  ( A 7 ) 

p(7)  satisfies  the  relationship 

J*  p(T)dV-Q  .  (A8) 

p«fKKl 

where  the  period  nuclear  charge  Q  is  defined  as 

.  (A9) 

*T 

(7)  The  centroid  of  the  nuclear  charge  distribution 
P*  is  given  by 

Pf-2Z.fR.r  •  <AI0) 

(8)  The  centroid  of  the  electron  charge  distribution 
P_  is  given  by 

P-  —  f  d:  f  dx  dy  p( 7)7  .  (All) 

(9)  The  nuclear  structure  factor  s<  k)  is  defined  by 

J  <  k  )  -  J  Z,r  exp(<  k  ■  )  .  (A12) 

Partitioning  of  electrosutic  energy 

(1)  The  classical  Coulombic  energy  £c,  arising 
from  the  electrostatic  interaction  among  the  nuclei 
and  electrons  can  be  written  as 

(A  13) 


5** 


(A4b) 


■  "  .—•-"  A-.'"  r;,.;  -*  '•■•  V?.^  -V  ‘  V*  *•  '. "'IgrfrJ 
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(2)  U  is  the  nuclear-nuclear  repulsion  energy  given  (5)  Following  the  an 


Z„Z  .  . 

L’-i  T  _  ■  —  -±-'-— 

«wr  lR-«wr~R _*  *- 


.here  the  prime  to  the  summation  indicates  the 
mission  of  the  self-energy  terms. 

1 3)  (  V )  is  the  electron-nuclear  attraction  energy 
;iven  by 

(l/)m-Jd>rp(7)  2  — T  (Al 

in**  I  r  —  KMf| 

14)  C  is  the  electron-electron  repulsion  energy 
.:iven  by 


(5)  Following  the  analysis  of  the  main  text  we  par¬ 
tition  the  respective  energies  as  follows: 


U  ~  U,  +  U\  +  U}  +  U)  .  (A  17) 

<K>-<K>,  +  <f'>,j  +  <'/)j  ■  *  A 18) 

C-C,  +  C12+Cj  .  (A  19) 

(6)  If  f  x  (  P»-  P_)  -15,  i.e.,  no  net  dipole  mo¬ 
ment  parallel  to  the  R„  plane,  we  have 

U,+  (V)t  +  C.~ 0  .  (A20) 

Otherwise  the  present  method  should  be  modified. 

(7)  (/,  is  given  by 


,,  .JLML  \7'.  f  - 1  k,'-kl 

U:  T7~  l.z*T  -  IT'  L-  V 

nt  *2  *>*2 


. .  ar  +  l-t-(a2  +  2ar  +  D'/3  ,  I  ( a2  -  2al  +  l)l/2  4-  a  -  r 

F,(a.r)-2  /(/,./,) -2£  [in  +  l)«  «  (a2  +  2*/  +  1 -  a  -  / 


-42*. 


3  ‘  (a2L2  +  2atLv  +  v1Vn  +aL  +lv 

6 v  10  {a1L1-2a/Lv  +  v1)'n-aL  +rv  ^L 


-22*.  In  ia'ul  +  l**L+L1Y"  +  L+av  -22 mA/vILD-WL-L 

,  1  bu  {a1!/1  ~2atuL  +  L1)'1  —  L  +  atu\limL  ,j 


u)  — — - -f(u.v)  ,  (A 23) 

bu'bir 

f  ( u.  v )  —  (a2u!  +  2aru  v  +  v2)_,/2  .  (A24) 

e,  +  1 )!  .  <A25)  ' 

The  '  and  y  summations  are  over  odd  integers,  and 
:ne  double  primes  on  the  /  sum  indicate  (a)  deletion 
of  <b)  weights  ({)  on  the  edges  of  the 

mner  summation  region  /  (see  Fig.  I),  (c)  weights 
i  ■*  i  on  the  corners  of  /. 

(8)  U i  is  given  by 

U^lML  T'Z.,2  .  .P<x,.xt.y:a.t)  .  <A26) 

4  o^"  l  *  * 


(A23) 

x,  -  it, 

■(K„~r  ■  ■)  . 

"T  it  r 

( A  27 ) 

X2-  ki 

\  4  l  Rut  “*  R  •  •  ) 

(A28) 

(A24) 

H  t 

(A25)  * 

y-kr 

If  •(R„-RitV)|  . 

( A  29) 

and  F  is  partitioned  according  to 

Pix,.X}.y.a.i )  -  F,(  Jrj.v  )  +  F-lXi.Xj.y  :a.r )  . 

(A30) 

with 

) "  — Intcoshy  —  cosxi ) +y  -  In2  .  (A3I) 


P-(.X\.x\.y.a.i )  —  4  J  2  (cosi  (x,  —  x.ar  —  2irwof  )n  1  Kol  o(  I  —  r2 )  ^‘n  ( (xj  +  2rrm  )2 -t-r' )l/!  I)  .  (A32) 
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(9)  ( K)i2  and  Cu  can  be  most  conveniently  combined  to  give 


/i/\  ^  Cm  ,,  r- 

<l'>u  +  C,j-  —  J  dkt  2,p(K,  +  <t,z)  - Vim - 

Ao  Im 00  l  Kt  +ks 


(10)  U j,  <  K)j,  and  C3  can  be  combined  to  give 
A i1  CT  \s(k,i)-p(k,z)\2 


£/i  +  <K)3+c,-iii4r  ■ 

A  o 


dk,  . 


(A33) 


(A34) 


'Permanent  address:  Physics  Department.  University  of 
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